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Thermal Buckling of Dual-Coated Fiber 


By T. A. LENAHAN* 
(Manuscript received August 8, 1984) 


An analysis of buckling is presented for dual-coated fibers within their 
coating at low temperatures. Buckling causes microbending of the fiber axis, 
the prime source of added optical loss in the fiber. Buckling is caused by 
compressive stress exerted on the fiber by the coating, which arises because 
the thermal expansion coefficient of the coating is substantially larger than 
that of the fiber. Calculations show that buckling is more likely when the 
inner primary coating layer is soft and thick. Previous experimental results 
on fibers in cables indicate that softer, thicker primaries lead to more added 
loss at low temperatures, contrary to the simple model where lateral pressure 
imprints irregularities onto the path of the fiber. Such is the evidence that 
buckling occurs. The theory is applied to various coating designs. The calcu- 
lated results rank the low-temperature performance reliably, but they also 
indicate that thermal strain falls short of the buckling strain. Other sources 
of strain are suggested, including a mechanism whereby irregularities induced 
by lateral pressure on the outside produce bending moments on the fiber. 


I. INTRODUCTION 


Optical fibers put in cables and/or subjected to low temperatures 
can have added transmission loss, attributed to microbending of the 
fiber axis.’ Microbending can result from increased lateral pressure 
inside the cable, which imprints irregularities there onto the path of 
the fiber.” The imprinting is accentuated when the materials are stiffer, 
as at lower temperatures. Dual coatings (see Fig. 1) were introduced 
to reduce the effects of lateral pressure by buffering the fiber with a 
soft inner primary layer; the outer secondary layer is hard and robust 
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Fig. 1—A dual-coated fiber. 


to allow handling. In practice, dual-coated fiber does indeed show 
substantially less added loss than single-coated fiber.* 

Microbending also can arise from buckling of the fiber. Coating 
materials have thermal expansion coefficients two to three orders of 
magnitude larger than that of the fiber; hence, the coating exerts a 
compressive stress on the fiber at low temperatures. When the thermal 
strain exceeds a critical limit, the fiber buckles within the primary 
coating. Even below the buckling limit, thermal strain may amplify 
fiber bending already present. Buckling has been directly observed in 
(1) single-coated fiber within a soft elastomer under forced bending* 
and (2) dual-coated resistance wire under thermal stress.° 

Indirect evidence exists for buckling of dual-coated fiber. Yabuta et 
al. report that thicker primary coatings lead to increased added loss 
at —50°C, irrespective of the outer secondary thickness, even though 
the primary material (silicone) remains soft at —50°C (see Fig. 7 of 
Ref. 6). This finding is not consistent with the simple lateral-pressure 
model, which says that more buffering gives less microbending. It is 
consistent with the buckling model because, as will be shown, thicker 
and softer primary layers provide less resistance to lateral displace- 
ment of the fiber and hence to fiber buckling. Results of the so-called 
S5 experiment,’ involving various coating designs, likewise indicate 
that softer thicker primaries lead to more added loss at low tempera- 
tures. 

In this paper thermal buckling of a dual-coated fiber within its 
coating is analyzed. Thermal strain on the fiber is calculated numeri- 
cally and found to agree reasonably well with the simple rule of 
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mixtures. Buckling strain, the minimum strain required for the fiber 
to buckle, depends mainly on the lateral rigidity of the fiber in its 
coating and is measured by a spring constant x, which is also calculated 
numerically. The buckling analysis of Refs. 5 and 6 was flawed because 
these references incorrectly assumed that x was the modulus of the 
primary. 

The theory is applied to various designs including ones of Yabuta 
and ones used in the S5 experiment. Results indicate, as above, that 
the fiber is more likely to buckle when the primary coating is soft and 
thick. The calculated thermal strain by itself is less than the calculated 
buckling strain in all cases, but other sources of strain can combine 
with the thermal to reach the buckling strain. For instance, thermal 
stress combined with irregularities in the secondary caused by lateral 
pressure would put moments on the fiber. 

Whatever the exact mechanism for microbending, coating designs 
should account for the possibility of buckling. Experimental results 
indicate that the primary should be sufficiently thin, especially if the 
primary material remains soft at low temperatures. 

In the next section the buckling analysis is developed and the 
associated numerical calculations are indicated. In Section III the 
theory is applied to the resistance wire of Katsuyama et al.° and other 
silicone/nylon coatings of Yabuta et al.° and designs from the S5 
experiment.’ The paper is summarized in Section IV and certain 
aspects outside the model that favor buckling are discussed. One of 
the most important aspects, already mentioned, is the prospect that 
irregular lateral pressure in conjunction with thermal stress produces 
bending moments on the fiber. 


Il. ANALYSIS 


The analysis of fiber buckling involves the force or strain needed to 
buckle the fiber and the force actually on the fiber. In this section, 
after the mechanical properties of the relevant materials are indicated, 
the thermal strain of the fiber is calculated, and then the buckling 
strain is calculated. The fiber and coating are assumed throughout to 
be perfectly circular and concentric, homogeneous, and uniform along 
their length. 


2.1 Materials characterization 


Two kinds of structures are considered: dual-coated optical fiber 
and the dual-coated wire of Katsuyama. The dual-coated optical fibers 
considered have outer secondaries that are either an ultraviolet (UV)- 
cured urethane acrylate material (Borden) or nylon. The Borden 
secondary has as its inner primary either UV-cured material supplied 
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Table |—Material parameters 


T = —40°C T=0°C 
Ey E E a 
Material (psi) (psi) y (psi) y (°C) 
Glass 1.07E7 1.07E7 0.25 1.07E7 0.25 5.04E —7 
Wire 2.35E7 2.35E7 0.25 2.35E7 0.25 16E —5 
Desolite 4.60E5 9.15E3 0.495 2.47E2 0.49987 12E—-—4 
Hot Melt — 3.34E2 0.49982 9.15E1 0.49995 14kh-4 
Silicone — 2.6452 0.49986 1LATE2 0.49992 30F-—4 
Borden 6.4865 5.15E5 0.30 1.96E5 0.4244 0.6£ — 4 
Nylon — 3.08E5 0.333 2.93E5 0.341 1.0EF — 4 


by De Soto (Desolite*) or a thermoplastic material (Hot Melt). The 
nylon secondary has silicone as its primary material; this combination 
is used for the wire of Katsuyama et al.° and for optical fiber studied 
by Yabuta et al.® 

The modulus E, Poisson ratio vy, and thermal expansion coefficient 
a are the mechanical properties needed in the buckling analysis. The 
glass fiber and wire are elastic in the temperature range of interest; 
hence, a single value for the mechanical parameters characterizes 
these materials. The coatings are viscoelastic polymeric materials; 
their modulus values relax over time and depend on temperature. We 
simplify the modulus values here by considering only their values at 
24 hours (a typical time span for temperature drops from room 
temperature to —40°C). 

The values of FE and a for the fiber are taken from Table III of Ref. 
8 and for the wire from Ref. 5. Their Poisson ratio is taken as vy = 
0.25, the accepted value for a material in its glassy (stiff) state. These 
values appear in Table I. 

The 24-hour modulus EF of Desolite, Hot Melt, and Borden are 
shown as functions of temperature in Figs. 2 through 4, respectively. 
These moduli were synthesized from oscillatory data from a rheometric 
thermal mechanical spectrometer. Modulus values for 0 and —40°C 
appear in Table I. The thermal expansion coefficients (at low temper- 
atures) of Desolite and Hot Melt and Borden have been taken from 
Refs. 9 and 8, respectively. Values of E at 0 and —40°C for silicone 
and nylon are taken from Ref. 5; the value of a for silicone comes from 
Ref. 10 and for nylon from Ref. 6. These all appear in Table I. 

The Poisson ratios of the coating materials are estimated by assum- 
ing that the bulk modulus, 


E 


K= 3G — on’ 


(1) 


* Registered trademark of De Soto, Inc. 
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Fig. 2—Isochronal plot (24 hours) for the Young’s modulus of Desolite versus tem- 
perature. 
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Fig. 3—Isochronal plot (24 hours) for the shear modulus of Hot Melt versus temper- 
ature. 


is independent of temperature. Studies’ have shown that K increases 
only by about a factor of 2 (though E increases several decades) as the 
material goes from a rubbery to a glassy state. Taking v = 0.25 at the 
low temperature glassy plateau (where FE = E,), v is determined at any 
temperature by 


y = 0.50 — 0.25E/Ey, (2) 


derived from the constancy of K. 
The values of FE, are taken from the 1- or 2-second modulus at the 
lowest temperature measured (typically —60°C). For primary materials 
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Fig. 4—Isochronal plot (24 hours) for the Young’s modulus of Borden versus temper- 
ature. 


where a plateau value was not attained or was unavailable, the E, for 
Desolite was used to estimate v. Available values of EH, and calculated 
values of v appear in Table I. 


2.2 Thermal strain 


The different thermal expansion coefficients can be consolidated 
into an effective expansion coefficient ce; for the coated fiber (or wire) 
as a whole. The rule of mixtures® approximates ce by weighting the 
various expansion coefficients by the cross-sectional area and the 
modulus of the corresponding materials. For N materials the formula 
is 


N N 
Qeff = > anAnEn > AnEn, (3) 
n=1 n=1 


where a, denotes the expansion coefficient, A, the area, and E, the 
modulus of the nth material. 

The rule of mixtures is exact when the coupling of radial displace- 
ments is neglected, as if the fiber and coating layers were parallel 
springs joined at the ends. An analysis accounting for radial coupling 
is described in Appendix A. Deviations from the rule of mixtures are 
usually within a few percent, but examples have been found with 
deviations from —15 to +386 percent. 

The thermal strain of the fiber for a temperature change T> to T; is 


T; 
€therm = vf (Qere a agp) dT, (4) 


0 


where a, denotes the thermal expansion coefficient of the fiber by 
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itself. In general, eg will depend on time and temperature through its 
dependence on E and a of the coating materials. With this information, 
the strain of the fiber can be determined as a function of time for any 
temperature cycling as in Ref. 8. 

For simplicity, the aeg will be taken here as the effective expansion 
coefficient averaged over the range of temperature change. As for agp, 
it is independent of temperature; so the thermal strain of the fiber for 
a temperature change T' is simply 


€therm — (Qert = afb) T. (5) 


The value of cee will be estimated by using 24-hour modulus values at 
—40°C and, for comparison, 0°C. 


2.3 Buckling analysis 


The fiber (or wire) may buckle and follow a wavelike path because 
of compressive strain. The theory of elastic stability’? is used to study 
the buckling of fibers within their coating. 

The fiber is treated as a beam in an elastic medium. A force and 
moment balance yields the differential equation 


oF 4 P24 y= (6) 
for the deflection y of the fiber as a function of the distance z along 
the fiber. Small deflections are assumed. The parameters E;, I, and F 
denote the modulus of the fiber, the moment of inertia of the fiber 
(xr}/4 for radius rs), and the compressive force on the fiber, respec- 
tively. The parameter x denotes the spring constant of the fiber, which 
is the ratio of the centering force exerted by the coating to the 
displacement of the fiber from center. 

Early work incorrectly assumed that x = E,,”*® but Vangheluwe,” 
assuming a rigid secondary, determined that 


ArE,(1 — vp)(38 — 4p) 


: (7) 
rp/rs)? — 1 
(1 + ~@ — 4y,)"In(rp/r;) — we 
where subscript p signifies the primary region and f the fiber (or wire). 
A numerical calculation of x, which accounts for the elasticity of the 
secondary, is described in Appendix B. The two calculations give 
identical results when the secondary is assumed rigid. Results indicate 
that the elasticity of the secondary can reduce x by more than 80 
percent. 
The buckling solution has the form 


y =A sin (72) (8) 
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E,I 


K = 


where A is an arbitrary amplitude and P is the pitch. (This form in 
conjunction with 


x = A cos (2) (9) 


for the orthogonal component covers the case of helical buckling.) 
Substituting y into eq. (6) gives 


a[oa(%)-r(%)+4]en(%2)<0 oo 


Hence, the force required for the fiber to buckle with pitch P is 


2 2 
F=E,I (=) +k (2) : (11) 
The minimum buckling force is 
Frin = 2VEpI« = r7VrEpx, (12) 
corresponding to a pitch of 
Prin = 20(E,I/x)4* = arj(4aE;/«)™. (13) 


The corresponding strain of the fiber is 


€min > ‘min/ 1177 Ey = \ / i (14) 


This formula shows that fibers having smaller spring constants, which 
are associated with softer and/or thicker primaries, require less strain 
to buckle. 

If étherm > €min, then the fiber will buckle in its coating. If éhem < 
€min, then buckling can still occur if étherm + €res > €min, Where €res denotes 
residual strain caused by initial bending or other moments. Even if 
Etherm + €res < €min, thermal bending can occur. In thermal bending, 
initial bending or other moments of the fiber are accentuated by the 
thermal stress. The size of the effect grows as (€min — therm — €res)*, AS 
shown in Ref. 12. 


Il. APPLICATIONS 

In this section the buckling analysis is applied to the dual-coated 
wire of Katsuyama, for which buckling was observed, and to various 
dual-coated fiber designs. Parameter studies are also presented. 
3.1 Wire of Katsuyama and fibers of Yabuta 


The wire of Katsuyama et al.° had a dual coating where the primary/ 
secondary was composed of silicone/nylon. The material parameters 


1572 TECHNICAL JOURNAL, SEPTEMBER 1985 


for these are given in Table I. The wire radius was r, = 75 wm. The 
outer radii of the primary (r,) and the secondary (r,) were rp/rs = 175 
um/600 pm. 

The calculated spring constant is x = 6.9E8 psi, and effective thermal 
expansion coefficient is der = 5.387E — 5°C. If we assume a temperature 
drop of 100°C, the strain on the wire from eq. (5) is éherm = 0.38 per- 
cent, and the minimum buckling strain from eq. (14) is émin = 0.97 per- 
cent with a pitch of Pyin = 3.39 mm. These values were obtained using 
the material parameters for —40°C. For 0°C, where the coating mate- 
rials are somewhat softer, the calculations give x = 3.87E3 psi and err 
= 5.25E — 5°C7'; the same temperature change of 100°C gives étherm = 
0.37 percent and émin = 0.72 percent with a pitch of 3.92 mm. These 
calculated values are summarized in Table II. 

As buckling did occur in the Katsuyama wire at —70°C, the differ- 
ENCe Enin — €therm eStimates the residual strain «,., in the wire at 30°C. 
Relative to the —40°C parameters, és; ~ 0.59 percent; relative to the 
0°C parameters, é€res ~0.35 percent. 

Yabuta et al. (see Fig. 7 of Ref. 6) studied the silicone/nylon coating 
system on optical fibers. One design having a relatively thick primary 
had substantial added loss at —50°C; its dimensions were r,/r, = 250 
um/471 um. Another design with a thinner primary had negligible 
added loss even though the secondary had somewhat more cross- 
sectional area; its dimensions were 100 um/448 um. The fiber radius 
for both was r; = 62.5 wm. 

Calculated values for k, ett, €therms min, ANd Prin are given in Table 
II for both designs using both 0 and —40°C material parameters from 
Table I. For the lower-temperature parameters, the thick primary 
Z1VES étherm = 0.61 percent and e€min = 0.81 percent with a pitch of 3.09 
mm, and the thin primary gives étherm = 0.59 percent and émin = 3.10 
percent with a pitch of 1.57 mm. These values indicate that buckling 


Table II—Buckling quantities for certain silicone coatings 


Qeff €therm €min P, min 


K 
Temp. (psi) Ce) (%) (%) (mm) 


(a) Katsuyama et al. 





—40°C 6899 5.367E — 5 0.377 0.967 3.39 

0° 3868 5.253E — 5 0.365 0.724 3.92 
(b) Yabuta et al., poor 

—40°C 2184 6.146E — 5 0.610 0.806 3.09 

0° 1219 5.969F — 5 0.592 0.602 3.57 
(c) Yabuta et al., good 

—40°C 32393 5.898E — 5 0.585 3.104 1.57 

0° 18571 5.772E — 5 0.572 2.350 1.80 
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in the first case is at least as probable as for the Katsuyama wire, but 
much less probable in the second. 


3.2 S5 experiment 


Four dual-coat designs were selected from the S5 experiment.’ Their 
performance, based on added loss at low temperatures, ranged from 
good to bad. Figure 5 shows the added loss versus temperature for the 
best and worst cases. 

Three of the four designs used UV-cured Desolite for the primary, 
the other used Hot Melt. All used Borden for the secondary. The 
coating dimensions varied in primary outer diameter/secondary outer 
diameter from 8/13 to 11/15, expressed in mils. The designs are 


0.9 (a) 
0.8 
0.7 
0.6 
0.5 
0.4 
0.3 
0.2 


0.1 


0.9 (b) 


0.8 
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WAVELENGTH, 1.29 TO 1.33 MICRONS ADDED LOSS IN dB/km 


0.4 
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0.2 


0 a 
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TEMPERATURE !N DEGREES FAHRENHEIT 


Fig. 5—-Added loss at \ ~ 1.3 um versus temperature for the (a) best coatings from 
the S5 experiment and (b) worst coatings from the S5 experiment. 
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Table III—S5 experiment 


Size Temp. Coat K Qett €therm €min Prin 
(mils) (°C) (P) Rank _(kpsi) (°C) (%) (%) (mm) 
8/13 -40 UV Best 570.6 0.112E-—4 0.107 13.03 0.76 

0 28.6 0.487EF-—5 0.044 2.92 1.61 

10/13 -40 UV Second 288.8 0.855E—-5 0.081 9.27 0.90 
0 10.3 0.349F —5 0.030 1.75 2.07 

11/15 -40 UV Third 218.1 0.120EF-—4 0.115 8.05 0.97 
0 747 0.494E-—5 0.044 1.49 2.18 

11/15 —40 Hot Melt Worst 10.1 0.1388E-4 0.133 1.73 2.08 
0 2.78 O0.511E-—5 0.046 0.91 2.88 


specified in Table III, together with calculated values of k, cefr, €therm 
(for a 100°C temperature drop), and é€min and Pin using the material 
parameters for —40 and 0°C. 

The calculated difference é€min — €therm tracks the performance rank 
of the design for both 0 and —40°C. The design with the stiffest and 
thinnest primary had the least added loss; the one with the softest, 
thickest primary had the most. 


3.3 Parameter studies 


The thermal and buckling strains depend on the geometry and 
materials of the coating. This dependence is now studied in general 
terms. 

The thermal strain éherm is proportional to the quantity, der — agp. 
By the rule of mixtures, 


ee a A,Ep (ap _ fib) - A,E;(as x fib) 
aoe ae AjEy + ApEp + AsEs 
Usually, both the area A, and modulus E, of the primary are much 


smaller than A, and E, of the secondary. Neglecting terms with A,E, 
gives 


(15) 


A;E.(as — afb) _ Qs — Qfib 


eff — Afb = A;E; + AsE, . 1 + (A;E;/A,E) 


E;/Es 
ett — Ash = (as — ew) / 1 + Cine =i a S| (17) 


Figure 6 shows plots of this expression versus r,/7;, assuming E,/E, 
= 20 (as for Borden at —40°C) and E;/E, = 50 (as for Borden at 0°C) 
and also r,/r; = 1.5. The plots again show the well-known fact that 
larger, stiffer secondaries produce more thermal strain than smaller, 
softer ones. 

The buckling strain émi, from eq. (14) is proportional to Vx and 
inversely proportional to VE;. The latter implies that wire filaments, 


(16) 


or 
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Fig. 6—Effective thermal expansion coefficient versus secondary radius for two 
secondary modulus values according to the rule of mixtures. 


having a higher modulus, require less strain to buckle than glass fibers. 
The spring constant x will be studied first for a rigid secondary to 
which the formula of Vangheluwe in eq. (7) applies. The elasticity of 
the secondary will be considered subsequently. 

If X = r,/r; is close to 1, the approximation“ 


Sat Peary 
in Xess +3 aa) = 
gives (for vy = pp) 


Ark(1 — v)(3 — 4) 


i ee eT: 
(1 + nfac — v)(1 — 2p) Yad ai (3 = ) fees + ‘| 


Ko 


(19) 





When vp = 1/2, eq. (19) reduces to 
4x E,(X? + 1)° 
(x?-1) ’ 
when » is not close to 1/2 and X = 1, eq. (19) reduces to 
TEp 3—4v\ X?> +1 cK, (3 —4v\ X*> +1 
21 —2v)\1+v/] X?-1 G \l+v/)] X*-1’ 


(20) 


ko 














(21) 


kK = 
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Fig. 7—Spring constant versus primary radius for three Poisson ratios assuming a 
rigid secondary. 


where K, denotes the bulk modulus of the primary. In all cases, x 
increases without bound as X approaches 1 (i.e., as the primary 
becomes thinner). If X is large, the logarithmic term in the denomi- 
nator dominates to give 


_ ArE,(1 — v) 
"(1 + )(3 — 4v)ln 75/7; 


Thus, x goes to 0 as r,/rs becomes large. 

Figure 7 shows «/E, plotted versus r,/r; for v = 0.40, 0.49, and 0.499. 
The curves illustrate the asymptotic behavior of x given above. They 
also indicate that thicker primaries have smaller x with greatest 
sensitivity when vy is close to 1/2. The compliance of the secondary 
leads to lower x. 

Figure 8 shows x versus r,/r; for E, = 500, 200, and 100 kpsi with 
corresponding py, = 0.3, 0.42, and 0.46, respectively (chosen to keep the 
bulk modulus fixed). The primary was assumed to be Desolite at 
—40°C with r,/r; = 1.5. The curves show that « is smaller for thicker, 
more compliant secondaries. The compliance of the secondary can 
cause x to drop to 20 percent of the value from the formula of 
Vangheluwe. 


(22) 


IV. SUMMARY AND CONCLUSIONS 


This paper has concerned buckling of dual-coated optical fiber 
caused by compressive stress on the fiber exerted by the coating at 
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Fig. 8—Spring constant versus secondary radius for three Poisson ratios of the 
secondary assuming a Desolite primary at —40°C with r,/r; = 1.5. 


low temperatures. The stress arises because the thermal expansion 
coefficient of the coating is substantially larger than that of the fiber. 

Buckling is one of two mechanisms used to explain microbending of 
the fiber axis, the prime source of added optical loss in the fiber. In 
the other method, lateral pressure imprints irregularities around the 
fiber onto the path of the fiber. This occurs at room, as well as low, 
temperatures and in single-coated, as well as dual-coated, fiber. The 
antidote to lateral pressure is dual coating where the inside primary 
coating buffers the fiber and decouples it from the outside. However, 
too much buffering has been found to produce more added loss at low 
temperatures, presumably due to buckling. 

Two basic ways exist for preventing buckling. The thermal strain 
on the fiber might be decreased, or the strain needed to buckle might 
be increased. 

Thermal strain depends mostly on the secondary layer. Smaller 
secondaries put less strain on the fiber; secondaries of a more com- 
pliant material or ones with a smaller thermal expansion coefficient 
have the same effect. These changes are compatible with the dictates 
of the simple lateral-pressure model. 

The buckling strain depends on the spring constant x. Thinner 
primaries and, to a lesser extent, thinner secondaries have larger 
buckling strains. Stiffer primary materials and, to a lesser extent, 
stiffer secondary materials provide the same effect. The magnitude of 
these effects is indicated in Fig. 7 for the primary and in Fig. 8 for the 
secondary. In general, reduced buffering provides more resistance to 
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buckling—just the opposite of what the simple lateral-pressure model 
dictates. 

Experimental results indicate that buckling does occur. In the 
silicone/nylon coatings of Yabuta et al.,° thicker primaries were asso- 
ciated with substantially more added loss at low temperatures. Cal- 
culations in Section III showed that the coating with thin primary and 
low added loss at —40°C needs about four times as much strain to 
buckle as the one with thick primary and high added loss at —40°C. 
The primary material (silicone) remains relatively compliant (264 psi) 
at this temperature. The experimental results are consistent with the 
buckling mechanism, but not with the lateral pressure mechanism by 
itself. Similar results were found for fibers in the S5 experiment.’ 
Thick primaries of the compliant Hot Melt material performed poorly 
compared with thinner primaries of De Soto, which stiffens at low 
temperatures. This outcome may be explained by the buckling calcu- 
lations in Section III, which indicate that the spring constant for the 
latter coating is 50 times greater than for the former. 

Nevertheless, the calculated thermal strain was less than the buck- 
ling strain in all cases, even for the wire of Katsuyama et al. where 
buckling was known to have occurred. The short fall may be explained 
by various factors outside the model, as follows: 

1. Initial bending (e.g., stranding) involves a strain on the fiber, 
which adds to the thermal strain to bring the total closer to the 
buckling strain. 

2. The thermal strain depends on the temperature drop, which was 
taken as 100°C. The precise temperature drop should be measured 
from a reference temperature where the residual stress on the fiber is 
0. Because this temperature is uncertain, the temperature drop is also 
and might be more than 100°C. 

3. Asymmetry of the coating or eccentricity of the fiber in its coating 
in conjunction with thermal stress produces bending moments along 
the fiber because, at equilibrium, moments on the coating must be 
balanced by moments on the fiber. These deformations can arise from 
imperfections in the coating process or lateral pressure in the cable. 
Thus, irregularities associated with lateral pressure can be transmitted 
to the fiber despite the buffering protection by the primary layer. 

4. Thermal bending, a precursor to buckling, would produce a 
steadily increasing added optical loss as temperature drops for strains 
below the buckling strain. 

Other omissions include details of the viscoelastic nature of the 
coating materials, the thermal dependence of the expansion coeffi- 
cients, the thermal cycling, and thermal gradients. The buckling 
calculations must be regarded as estimates most valuable in making 
comparisons. 
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APPENDIX A 
Calculation of Thermal Stress 


This appendix gives a method for calculating the effective expansion 
coefficient of a dual-coated fiber. The calculation goes beyond the rule 
of mixtures by treating the coupling of the radial displacements of the 
various layers. 

Define cylindrical coordinates (r, 6, z) based at the center of the 
coated fiber. The displacement in each of the three regions can be 
represented by 


U, es + bar 
U={U,}={| 7 0 n = 1, 2, 3. (23) 
U, 2 
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The fiber has n = 1, the primary coating n = 2, and the secondary 
n = 8. This vector function satisfies the Cauchy Navier equation” for 
displacements and represents a solution without warping (because C 
is independent of n) or angular deformation. 

The strain components are 


—_ dU, fo aa Qn + b 
cr Or rp 
1 OU, U, es an 
> ap Pte 
ou. 
€60 rs 
€oz = €or = €rz = 0. (24) 
The first invariant of the strain tensor is 
€ = 6, + €g + &; = 2b, + C. (25) 


Stress components are obtained from the generalized Hooke’s law, 
oj = redy + 2G; = BT6y, (26) 


which gives 
Orr = Ae + 2Ge,, — BT 
Ozz = de + 2Ge,, — BT, (27) 


where 


E E 2Gy 


oy OSes) aa 


(28) 
a is the thermal expansion coefficient, and T is the temperature 
change. The material parameters are assumed to depend on the region 
or layer, but they are assumed constant within each layer. 

The unknown coefficients used to describe the displacement U are 
determined from the various conditions. The displacement must be 
bounded, so a; = 0; and it must be continuous, so 


ay = (by — be)ri and 3 = Az + (bo — bs)r3, (29) 


where r,, ro, and rs denote the radius of the fiber, the primary, and 
the secondary, respectively. The radial stress component o,, must be 
continuous, so 
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b, (2A, + 2G,) + CrAy — iT 
= bo(2d2 + 2G2) — a22Go/r? + Cro - BoT 
by (2d. + 2G2) — ae2Go/r3 + Crs — oT 
= b3(2d3 + 2G2) — a32G3/r5 + Cr3 — B3T 
bs (23 + 2G2) — ag2G3/r3 + Cr3 — B3T = 0. (30) 


Finally, the total longitudinal force or integrated stress must be 0; 
hence, 


1 
; [20D + (r3 — r2)o2 + (73 — 73)o2] = 0 (31) 


or 
r7[b, 21 + Cy + 2G;) - BT | 
+ (r5 — rj)[be2d\2 + C(A2 + 2G2) — B2T] 
+ (r3 — r3)[bo2A3 + C(A3 + 2G3) — B3T] = 0. (32) 


As for the moments, they are automatically 0 when the coatings are 
concentric. 

These six linear equations in the six unknowns (dz, a3, bj, be, b3, C) 
can be solved by standard numerical methods. The object of the 
calculation is the coefficient C, which denotes the thermal strain of 
the structure. When T is set to 1, then C is a,, the effective expansion 
coefficient. 


APPENDIX B 
Calculation of Spring Constant 

This appendix gives a method for calculating the spring constant x 
of a fiber within a dual coating. The calculation goes beyond the 
formula of Vangheluwe’? by accounting for the elasticity of the outer 
secondary coating. 

Define cylindrical coordinates (r, 6, z) based at the fiber center. The 
displacement function 

_{U; 


must satisfy the Cauchy Navier equation’® 

1 
~ Poy 
The outer surface of the secondary is assumed fixed, and when the 
rigid fiber is translated 6 in the x direction, 


V?U + CV(V-U) =0 C 





(33) 
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6a, = 5(cosé a, — sin@ ag), (34) 


the displacement at r = r, for small 6 is 


_ {| 6cos 6 
eS & sin AY (35) 
This means that the angular dependence, in general, is 
_ { U,(r) cos 6 
U= tae sin a (36) 


The four solutions of the Cauchy Navier eq. (33) having this angular 
dependence are 


t=" 3?) (828) (ADE 
— ’ 277) _ ? —2]? 

U,(r) Shah (2 + 3C)r 1 r 

as can be checked by direct substitution. The general solution is a 
linear combination of these four with four unknown coefficients for 


each of the two coating layers. 
The total of eight unknown coefficients are determined by eight 


conditions. At ri, 
Oe\ = i 
(ci) -(4) 
U,\ _ [0\. 
U;,} \o/’ 
U, 
Us 


are continuous across the interface. These give six equations. The 
other two conditions involve the normal stress components o,, and 
7,0, which must be continuous at re. In terms of U these are 


at rs, 


and at ro, 


r r r+ 
wy ote Oa 
or or r 
~+ 
peg gee (38) 


or r 


where \ = 2uvC and the angular dependence of eq. (36) has been used. 
The eight linear equations in eight unknowns can be solved by stand- 
ard numerical methods. 
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The force on the fiber for the deflection 6 is 


Qa 
F= { o-a,ds = { (o,cos?0 — 7,9 sin?0)r,dé 
Fiber 0 








= mi(o-(r) — Tro(T1)). (39) 
The stress quantity 
U, re) U,+ U 
ppm) =i aa. 1) 
or or r 
equals 
(A + p) : + Qu li/r 
Papen te. 


for the first solution in eq. (37) and is identically 0 for the other three. 
Therefore, when 6 is set to 1, 


2 
K=T7 0 + Mp) 2+, + 2, (41) 


where p signifies the primary region and a, denotes the coefficient of 
the first solution, 


C 
In r/ry = 246 
—Inr/ry; 


in eq. (37), in the primary region. Thus, of the eight unknown coeffi- 
cients, only one is needed for getting the spring constant. 
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The beam propagation method, based on the parabolic approximation to 
the wave equation, is used in conjunction with Papoulis’ redefinition for 
optical fields of Woodward’s ambiguity function. A simple derivation is given 
of Tatarskii’s formula for the lateral coherence function, and hence the mean 
intensity profile, of a laser beam propagating through a turbulent atmosphere. 
Statistics of the received signal and the effects of spatial nonstationarity of 
the turbulence can also be deduced using this technique, as can the effects of 
very large-scale variations in refractive index and receiver directivity. 


I. INTRODUCTION 


There has been a recent revival of interest in the propagation of 
laser beams through the atmosphere for communication purposes. 
King et al.’ have conducted experiments that show that a laser is an 
effective standby substitute for a microwave link over a clear-air, line- 
of-sight path of several tens of kilometers. When the microwave link 
is subject to severe multipath fading, the laser signal is found to be 
much more stable. In these clear-air conditions the laser beam is 
mainly affected by the atmosphere’s turbulence, which produces a 
spread of the propagating beam in excess of that expected due to 
diffraction. Over a 37-km path the lateral intensity profile of the laser 
beam is found to be random but to have an average Gaussian shape 
with a spread of about 6m between e~? points. 

More than two decades of research on the theory of optical propa- 
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gation through random media has been very competently reviewed by 
Strohbehn and others;” the chapter by Ishimaru? is particularly rele- 
vant here. The wide variety of approaches taken by different authors 
is apparent, ranging from the purely physical to the highly mathemat- 
ical. The present paper seeks to produce a reasonably simple theoret- 
ical picture, which is accurate both physically and mathematically, 
and which will also be useful for engineering purposes. 

The starting point, as so often elsewhere, is the parabolic approxi- 
mation to the wave equation,’ but it is used here in a manner that has 
become known as the “beam propagation method.”° Other names for 
the method are the “split-step Fourier technique” of Tappert and 
Hardin® and the “multiple random phase-screen method.””*® The es- 
sential idea that makes a simple solution possible is that, because the 
fluctuations in refractive index are so weak and their scale size is so 
large compared to the wavelength, the phenomena of diffraction and 
scattering can be artificially separated. The propagation path is divided 
into many short sections, so that the propagating wave is barely 
disturbed by each section, but their cumulative effect can be consid- 
erable. 

In each of these sections the irregularities are effectively removed, 
in the form of an accumulated random phase, to one or another of the 
boundary planes. Free-space diffraction is then allowed to occur within 
the now uniform section between the planes. The essential next step 
is that the Fresnel diffraction, which occurs between the planes in 
each of the sections, is described by what Papoulis® has called the 
“ambiguity function,” after the name given by Woodward" to a similar 
function of fundamental importance in radar. For an optical field the 
ambiguity function was redefined by Papoulis as the Fourier transform 
of the lateral mutual coherence function (i.e., the lateral autocorrela- 
tion function of the field). The “field ambiguity function” so defined 
has the very useful property that it propagates in a uniform medium 
without changing its functional form. What does change is the argu- 
ment, in a manner reminiscent of a wave traveling along a transmission 
line. 

On encountering the artificially accumulated random phase at each 
boundary plane, the field ambiguity function is modified appropriately 
but then propagates through the next section again without change. 
This procedure can continue as long as the propagating beam is 
essentially forward scattered, which is true for a laser beam propagat- 
ing through atmospheric turbulence at least out to 50 km, if not 
farther. Then over the final plane the lateral field autocorrelation 
function is obtained by taking the Fourier transform of the field 
ambiguity function. The mean intensity of the beam is contained in 
the field autocorrelation function as a special case. 
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It is gratifying that application of this simple procedure reproduces 
precisely what Ishimaru’ has described as “Tatarskii’s exact result”* 
for the lateral mutual coherence function of a laser beam propagating 
through atmospheric turbulence. In fact, because of its underlying 
physical clarity, the present procedure allows one to go a little farther 
than Tatarskii and describe the statistics of the propagating field in 
more detail, and also to deal with spatially nonstationary turbulence. 


ll. ELECTROMAGNETIC BASIS: THE BEAM PROPAGATION METHOD 

In attempting to solve propagation problems in which the scale size 
of the refractive-index irregularities is large compared with the wave- 
length, and the magnitude of these fluctuations is very small, it is 
often helpful to factor out the term exp(— jkz), assuming propagation 
in the general direction of the z axis. This procedure is analogous to 
factoring out the time dependence exp(jwt). Thus, any phasor com- 
ponent f(r, t) of the quasi-monochromatic propagating field can be 
written as 


f(r, t) = u(r, t)exp(— jkz), (1) 


where u(r, t) is a slowly varying phasor function of position r and 
time t. The propagation constant k is some convenient mean value. 

As a consequence of the assumed large scale size and tenuous nature 
of the refractive index irregularities, it can be shown*" that the 
function u(r, t), is governed by the parabolic equation approximation 
to the wave equation: 

fu fu. du 

ax? t gy? J2R ae 2k*nu = 0, (2) 
where n,(r, t) is the departure of the refractive index from its mean 
value, which will be assumed to be unity. Time fluctuations will be 
ignored in what follows, although they can be incorporated easily if 
required. 

Consider the time-invariant solution of eq. (2) for a wave launched 
from the plane z = 0. A well-established approach®*"! is to solve the 
equation in two iterative steps, and will be referred to here as the 
beam propagation method. First assume that there are no variations 
in refractive index, so that n; = 0. Then the solution for the field over 
any plane z is given by Fresnel’s diffraction formula” 


| . exp(— jkz ait (ie 
fle, 2) = j PH) "Pp, 0 
Zz —c —c 
jk 7\2 r\2 , , 
“exXP )— 5 (x — x’) + (y—y’) |p dx’dy’ (3) 
in terms of the field f(x, y, 0) over the aperture plane z = 0. 
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If diffraction now is ignored, by suppressing the first two terms in 
eq. (2), and if the irregularities in refractive index are restored, their 
first-order effect can be obtained from the solution of 


du 


+ jkn,u = 0, (4) 
Oz 


which is, by straightforward integration, 


u(x, y, 2) = u(x, y, O)exp{jP(x, y)}, (5) 


where 


&(x, y) = —-k if n(x, y, z)dz. (6) 


This is simply the phase induced by the refractive-index irregularities 
along straight-line paths, parallel to the z-axis, from 0 to z. 

Thus the solution of eq. (2) is in the two artificially separated parts 
given by eqs. (8) and (5). The first part allows for diffraction but 
suppresses the effect of the irregularities, while the second part sup- 
presses diffraction but allows for the effect of the irregularities. The 
two parts of the solution then must be combined in some suitable way, 
as illustrated in the following examples. The first is concerned with 
the nonrandom effect of the overall linear trend of refractive index 
with height, and the second considers the effect of the turbulence- 
induced, small-scale random irregularities in refractive index. 


2.1 Effect of linear gradient in refractive index 


If the refractive index varies linearly with height, with constant 
gradient g, then over a distance of Az eq. (6) gives the phase variation 
with height x as linear also, namely as ®(x) = —kgxAz. The mean 
linear trend in the atmosphere is usually negative, and so it can be 
seen that the effect of this negative gradient in refractive index will 
be to tilt the advancing wavefront forward through an angle g Az. If 
this continous forward tilt is interpreted as bending the propagating 
beam, giving it a radius of curvature R, then the angle of tilt would be 
Az/R. Hence R = g™, a well-known result.!? But it should be empha- 
sized that while the beam is being bent it is also experiencing diffrac- 
tion, according to eq. (3), and so spreads as it bends as it propagates. 


2.2 Effect of turbulent fine structure 


Having seen that the beam-bending effect of the mean linear trend 
in refractive index can be treated separately, consider now a medium 
that is on average uniform but whose refractive index n,(r) is a zero- 
mean random process. The magnitude of these fluctuations in refrac- 
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tive index is typically 10~°, for homogeneous turbulence conditions, 
with scale sizes of at least several millimeters, which is large compared 
to optical wavelengths. Hence the conditions for the beam propagation 
method to apply are fulfilled. 

Consider a segment of the medium between the beam-launch plane 
z = 0 and the plane z = Az. If the refractive-index irregularities are 
temporarily ignored, the field over the exit plane z = Az would be 
given by Fresnel’s diffraction formula of eq. (3). Now restoring the 
irregularities but temporarily suppressing diffraction, their effect is 
accounted for in the accumulated random phase along parallel ray 
paths of length Az: 


&(x, y) = +f n(x, y, z)dz. (7) 


The statistics of this random phase process will be important later 
and so are derived here. 

Since the refractive-index fluctuation process n;(x, y, z) is zero 
mean, 


(®(x, y)) = 0, (8) 


where the sharp brackets indicate taking the expectation. 
If n, is wide-sense stationary, with autocovariance 


Bn, (é, 0, §) = (mlx, y, 2)m(x + & yy + n, 2+ §)) (9) 
and variance 
on, = Bn, (0, 0, 0), (10) 


then the autocovariance of the phase process 


BalE, n) = w | f (n(x, y) z)ny(x + & y + Up z’))dzdz’. (11) 


This is equivalent to’ 


Az 


Ba(é, n) = k*Az J. Bn,(é, n, S)ds. (12) 


Now it is convenient to assume that the width of the section Az > {, 
the scale size of the irregularities in the z direction, and so 


foe) 


Balé, n) = k?Az le B,,(&, UB ds. (13) 


It also follows from the condition Az > { that the phase over the 
exit plane can always be taken to be Gaussian, whether the refractive 
index itself is Gaussian or not, as a consequence of the central limit 
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theorem. Hence, the random phase process is completely described by 
the autocovariance of eq. (13), and its variance can be taken to be of 
the order 


a3 ~ R?o2, foAz, (14) 


in which { is sometimes referred to as the integral scale size of the 
refractive index in the direction of propagation. 

Since the beam propagation method is, in essence, a perturbation 
technique, applied locally, it is important that over each section the 
phase variance o% <1. In applications such as laser beam propagation 
through atmospheric turbulence, this condition is easily met, even 

with the constraint that Az > . 


I. STATISTICAL FIELDS: THE AMBIGUITY FUNCTION 


The example of Section 2.2 will now be our main concern. So far 
we have the field distribution over the plane z = Az as fo(x, y, Az) 
exp{j®(x, y)}, where fo(x, vy, Az) is the free-space diffraction of the 
original aperture field and ®(x, y) is the random phase process of eq. 
(7). If the field over the plane Az is now allowed to diffract, Fresnel’s 
diffraction formula would give the field over the next plane, assuming 
that the intervening region is free space. The irregularities could then 
be replaced and a different random phase process could account for 
them over this next plane. This new field then can be allowed to 
further diffract, and so on. 

Over some plane z, well into the random medium, the phasor field 
component f(x, y, z) will itself be random. The property of greatest 
utility would be its lateral mutual coherence function: 


I(x, y; & 9; 2) = (f*(x — €/2, y — 0/2, 2) f(x + &/2, y + n/2, z)), (15) 


otherwise referred to as the lateral field autocorrelation function. (The 
asterisk denotes complex conjugate.) Hence the mean intensity 


(I(x, y, z)) = (f(x, y, 2) |?) = T(x, 9; 0, 0; 2). (16) 


However, I'(_ ) is not simple to calculate,” whereas its Fourier trans- 
form is. Papoulis® introduced the Fourier transform of I'(_ ), calling it 
the ambiguity function of the optical field, and showed that it greatly 
simplified the calculation of Fresnel diffracted fields. In particular, a 
very useful property of the ambiguity function is that it propagates 
without changing its functional form in a uniform medium; what does 
change is the argument of the function. 

The definition of ambiguity function that will be used here, for the 
field over the plane z, is 
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A(u, v3 & 7; 2) 


= a. 1. I(x, y; & 1; z)exp{jR(ux + vy)}dxdy. (17) 


If the medium between this plane z and the plane z + Az is uniform, 
it can be shown that, under conditions when Fresnel diffraction occurs, 
the ambiguity function over the plane z + Az is simply? 


A(u, v; &, 3 2 + Az) = A(u, v3 & — pAz, n — vAz;z). (18) 


The way in which the ambiguity function and this relation are used is 
described in the next two sections. 


IV. BEAM WAVE PROPAGATION THROUGH TURBULENCE 


Consider a beam of radiation launched into a turbulent medium in 
which the conditions for the application of the beam propagation 
method are satisfied, namely that the magnitude of the fluctuations 
in refractive index is very small and their scale size is very large in 
comparison with the wavelength of the propagating beam. If the field 
is launched from an aperture plane at z = 0, over which it is f(x, y, 0), 
the ambiguity function over the z = 0 plane is given by eqs. (15) and 
(17) as 


A(u, V; &, ”; 0) = Ao(u, V; g, n) (19) 


ae 
-3 [ pu-yey-a0 


‘f(x + &/2, y + n/2, 0)-exp{jk(ux + vy)}dxdy. (20) 


If the propagation path is divided into short sections of width Az;, 
i=1,2,.--,N, and if it is assumed that over each section the medium 
is uniform, with the effect of the irregularities swept forward onto the 
exit face, then the ambiguity function just to the left of the plane z = 
Az, is 


A(n, v; &, 9; Az1) = Aolu, ¥; & — Az, n — v2) (21) 
with eq. (20) substituted. 
The accumulated phase can be inserted at this point by writing 
f(x, y, Azi) = f(x, y, Aziexp{j ®(x, y)}, (22) 


where the phase ®(x, y) is given by eq. (7) and has the statistical 
properties derived in Section 2.2. Now assuming that the ®(x, y) 
process is stationary, over the lateral extent of the beam within the 
first section, it follows that the embiguily function at the exit face of 
the section is 
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A(u, v; & 1; Az%) 
= Aon, V; 3 = pAz, Hn vAz,)exp{ Ba, (E, n) a Bz, (0, 0)}, (23) 


where the phase autocovariance function Bz,(é, 7) is given by eq. (13), 
and the argument of the exponential is sometimes referred to as the 
phase structure function. So 


Bz, (é, n) = 0%,0(E, ») (24) 
with the phase variance over this first section given by 
0%, = k?o7, oAz, (25) 


where On, is the variance of the refractive index fluctuations, { is its 
scale size in the direction of propagation, and p( ) is the normalized 
phase autocovariance function. Thus eq. (23) is equivalently 


A(u, v; & ; Azi) 
= Ao(u, v; & — wAz, n — vAz)exp{—o3,[1 — p(é, n)}}. (26) 


It will be recalled from Section 2.2 that while Az; > {, it is to be kept 
small enough for o3, < 1. 

In the next section, of width Azo, again artificially separating the 
phenomena of scattering and diffraction, by analogy with eq. (26) 


A(nu, v3 &, ; Az + Az3) 
= A(u, v3 & — pAze, n — vAzg; Aztexp{— o3,[1 — p(é, )]}, (27) 


in which it has been assumed that the turbulence is statistically 
uniform along the beam. (This condition can be relaxed, and clearly 
ought to be in some circumstances, but at the expense of greater 
complication.) Combining eqs. (26) and (27) gives 


A(u, v; & 0; Az, + Az3) = Ao(u, v3 & — w[Az, + Azo], 
n — v[Az + Aze})exp{—os,[1 — p(é, n)]} 
-exp{—o3,[1 — p(E — wAz2, 1 — vAz2)]}. (28) 


This argument can be continued out to a distance 
N 


z= Db Az; (29) 
i=1 
provided only that the propagation is essentially in the forward direc- 
tion. This is ensured by the large scale size of the turbulence compared 
to the wavelength, and the small magnitude of the refractive index 
fluctuations. Then the ambiguity function at the plane z is 
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A(u, V; : n; z) = Ao(n, V; E — HZ, — v2) 


oo { Bafi-o(eof--d ad 
na» \. -> aa}. (30) 


0%, = Ron, SoAzn (31) 


in which 


and with eq. (20) substituted. 

The ambiguity function of eq. (30) can be written in integral form 
if the Az, can be taken to be sufficiently small. In the case of a He- 
Ne laser beam propagating through a turbulent atmosphere, Az, can 
be of the order of a meter, which is small enough when it is realized 
that both yw and v are always small. The assumption of the statistical 
uniformity of the turbulence throughout the length of the path is still 
maintained, and so 


A(u, v3 & 0; 2) = Ao(u, v3 & — wz, n — vz) 


Finally, the lateral mutual coherence function is given by the inverse 
Fourier transform of eq. (17) as 


I'(x, y; & 7; 2) 
= i. { A(u, »; & n; z)exp{— jR(ux + vy)}dudy (33) 


with eq. (32) substituted. This result is identical in form to Tatarskii’s 
exact solution of the differential equation for the mutual coherence 
function, quoted by Ishimaru.’ This agreement is encouraging, in that 
Tatarskii’s approach* is basically more rigorous but is physically 
somewhat obscure, whereas the present method keeps the physical 
meaning of the mathematics well to the fore. As an example, the 
method given here avoids Tatarskii’s assumption that the refractive 
index is delta-function correlated in the direction of propagation, 
which is a physical impossibility. In fact it seems that this so-called 
Markov approximation means physically that the induced phase proc- 
esses in each section are statistically independent. Also, the summa- 
tion form of the ambiguity function of eq. (30) could be advantageous 
in applications to paths along which the turbulence is nonstationary. 
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Incidentally, the mean field obtained from eq. (22), by taking its 
expectation, is 


(f(x, ¥, Azi)) = folx, y, Azi)exp \- seat, (34) 


where fp( ) is the field diffracted from the original aperture field in 
the absence of irregularities. Carrying out the same procedure over the 
N sections of the path gives 


(f(x, Y, 2)) = folx, y, z)exp \-3 = Ket, teh (35) 


which is also referred to as the coherent part of the field. Note that 
when the total accumulated phase variance 


Ob, = = k? On, $02 (36) 


becomes much larger than unity, the coherent field becomes negligible. 

Equations (35) and (383) give the first and second statistical mo- 
ments, respectively, of the propagating field. The physical mechanism 
also strongly suggests, as a consequence of the central limit theorem, 
that the field is complex-Gaussian distributed. The statistical descrip- 
tion of the random propagating field is therefore complete. 


V. PROPAGATION OF A LASER BEAM THROUGH ATMOSPHERIC 
TURBULENCE 


To find the mean intensity and other characteristics of a laser beam 
propagating through a turbulent atmosphere, it will be assumed that 
the beam is launched in its fundamental mode with a plane wave front, 
that is, 


x? = 2 
f(x, y) 0) = fo exp |- al ’ (37) 
Wo 
where Wp is the beamwaist parameter and fy is the complex amplitude 


at the center of the beam. The ambiguity function for this field is, 
from eq. (20), 


we 2 
Ao(u, v3 &, 7) = well exp |-28 PD? * (u? +} 


2 2 
-exp |- s _ } . (38) 


Equations (32) and (83) then yield the lateral mutual coherence 
function in this case as 
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2 
I(x, y; & m3 2) = mua for { { ep {-5 ae 0 (2 + | 


| (€ — wz)? + (yn - ar 
exp) -— —— 


2wo 


-exp |- k? 6%, $0 E - i pl— — wz’, 9 - v2"ve'| 
0 


-exp{— jk(ux + vy)}dudy, (39) 


and the mean intensity is obtained from this by setting £ and y equal 
to 0. 

It is easily verified as a check, using a standard integral, that 
eq. (39) in the absence of any irregularities in refractive index (i.e., 
On, = 0) gives the correct intensity formula for free-space propagation 
of a laser beam, namely, 


In(x, y, 2) = | fo [Pwo exp |- eral (40) 





w*(z) w*(z) 


where the beamwaist parameter 


w(z) = wo \/1+ (25) (41) 
rw 


depends on distance. The particular distance 
rws 
r 
usefully indicates the transition from the near field (z « zr), when 
the beam is essentially collimated, to the far field (z >> zr), when the 
beam spreads out linearly with distance. In the experiment of King et 
al.,1 for example, with the beamwaist at launch wo = 8 cm and the 
wavelength 0.63 um, the distance zr = 32 km. 

To obtain some analytical results for the mean intensity, it will be 
necessary to resort to some approximation. One way of doing this is 
to replace the variable z’ in the inner integral of eq. (39) by the 
constant z. Then the mean intensity, using eq. (16), is given approxi- 
mately by 


awl fol? iC i 2 2 
ele y; z))= = Oy? exp |-5 O(, a | 


we + y? 
-exp \-« ; Dwi } exp{—o3,[1 — p(— nz, — vz)]} 


oFr= 





(42) 








-exp{— jk(ux + vy)}dudv, (43) 
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in which o%,, [see eq. (36)] is the total accumulated phase variance 
over the length of the path. Equation (43) can also be written equiv- 
alently, by substituting 


pz=-& and vz=-—n, (44) 
so that 


(I(x, y, z)) = ewilfll { a exp {- (ae sia) e+ wi} 


-exp{— o@,{1 — p(€, miexp {i er ae didn. (45) 


If now the middle exponential in eq. (45) is written as the sum of 
two parts, as 
exp{—o3,[1 — p(é, n)}} 
= exp{—o3,} + exp{—o3,j|[exp{os,o(é, n)} — 1], (46) 
then the mean intensity formula of eq. (45) conveniently splits into 
the sum of the coherent intensity and the incoherently scattered 
intensity, namely, 
(I(x, y, 2)) = Io(x, y, z)exp{—o%,} + I(x, y, 2), (47) 
in which J)(x, y, z) is given by eq. (40) and 


2 2 
I;(x, y, 2) = Teo Lel expt 037} 
Jo few {- (ea oa oe (P+ rf 


= dtdn. (48) 


-[exp{o%,o(é, n)} — llexp \ 


It is interesting to examine the two extreme cases of 0%, <1and 0% 
>> 1, which correspond respectively to short and long propagation 
paths. 


5.1 Short paths 


If, according to eq. (36), the path length z is short enough to make 
03, < 1, then eq. (47) shows that the coherent part will predominate. 
Appropriate approximations in eq. (48) give the incoherent scattered 
power in this case as 


I(x, y, z) = ewtlf OS, i i p(é, n) 


ex |- (Zeb + a) + af ex MEA) adn, 49 
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If it is assumed that the initial laser beam width wo > a, where a is 
the typical scale size of the turbulence (of the order of 1 cm in the 
atmosphere), then the first exponential term in eq. (48) can be ignored 
and the mean intensity profile of the scattered field is the Fourier 
transform of p(é, 7). 

Thus, over short paths the laser beam will be only slightly dimin- 
ished in comparison to the free-space situation, and the energy lost 
will be scattered. When wy > a there will be an intense central spot 
surrounded by a faint halo of scattered light. The form of the scattered 
intensity profile will be determined by the lateral correlation of the 
turbulent refractive-index fluctuations. On the other hand, if wo <a 
the laser beam will snake its way through the turbulence, preserving 
its original profile but continually changing its direction in a random 
manner. 


5.2 Long paths 


If the path length is long enough to make o3,, >> 1, then according 
to eq. (47) the coherent part will be negligible, and so eq. (45) can be 
used directly to describe the now completely scattered field. Examining 
the middle exponential term in the integrand of eq. (45) reveals that 
for very large o%, its behavior will be dominated by the behavior of 
o(&, ») near the origin.’® For turbulence 


P+ 7? 
2 





pl, n) = 1—- 5 ae (50) 
where a is now to be interpreted as the dissipation scale size of the 
assumed uniform and isotropic turbulence. (This assumption may not 
be true of course, but any naturally occurring refractive-index fluctua- 
tions will have an autocorrelation function that behaves parabolically 
in the neighborhood of the origin.*’) Substituting eq. (50) into eq. (45) 
gives the mean intensity profile as 


2 2 
ee 


1 3 
SL = (Gast seat E+ 


sep \ oe didn. (51) 


So, if wo >> a, the o3,/a” term dominates, and a standard integral 
yields 





CRS ec ee |- ay) (52) 


w?(z) w?(z) 
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where now 

w(z) = NON (53) 

Ta 

Equation (52) gives the mean intensity profile as being Gaussian in 
shape, as observed in the experiment of King et al.’ The beamwaist 
parameter depends on z*”, since og, varies as Vz. Some care must be 
taken with the longitudinal integral scale {> of the turbulence, which 
is needed to evaluate og, (see eq. 36). It is tempting to identify it with 
the dissipation scale size a, used in eq. (50), but that is probably an 
underestimate. On the other hand, Ishimaru’s identification of it with 
the outer scale of turbulence!’ seems like an overestimate. The cau- 
tionary remark at the end of Section 5.3 should also be noted. 


5.3 Lateral field autocorrelation 


The autocorrelation of the propagating field over a plane is given 
by the lateral mutual coherence function of eq. (39). When the same 
approximation as for the intensity (eq. 43) is made, namely, replacing 
z’ in the inner integral of eq. (39) by z, the lateral mutual coherence 
function becomes 

+} 


I(x, y; & 0; 2) = will [ die exp |-=3 


=. \- (& — wz)? am = _ 


2wo 





“exp |-c8,t — ple = z,°4 = v2) expt-sPla + vy)}dudv. (54) 
Making the substitutions p = £ — uz, q = n — vz gives 
TWol fo Fe 
272” 


si ie exp | Sehte - py + ioe a] 3 
2 2 
-exp |- P _ } exp |- 0,11 — p(p, a} 


emp |- j Meets a \ oe dpdq. (55) 


In the long-path limit, when o%,, >> 1, and making use of eq. (50), 
the lateral mutual coherence function now becomes 


T(x, y; & 93 2) = 
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2) ¢ 12 
P(x, ¥; & 03 2) = BWalIOl oo |-j teem} 


PD ar 2 


Le few {- Eve ig py se +- ai} 


-EXD |- ar + ze) (p? + hex 4i eo dpdq. (56) 


2w? 


Performing the integration, and expressing the result in terms of the 
long-path mean intensity of eq. (51), gives 


I'(x, y; & 0; 2) = exp |-; Meta} 





-exp |- waa (EP + n)} (I(x, y, x)). (67) 


It should be remembered, however, that this result is based on the 
approximation made in deriving eq. (54) from eq. (39). The physical 
significance of this approximation is now clear and is the following. 

In effect, the accumulated random phase along the path has been 
incorporated in a single random phase-changing screen placed just in 
front of the radiating laser aperture. This observation follows from a 
very useful result given in Ratcliffe,!® which is that the angular 
correlation function for the far field is the Fourier transform of the 
magnitude squared of the aperture field distribution. Using this result, 
one would expect the scattered field to be initially correlated over 
angles of the order of A/(mzwo). But that with increasing z, as the 
magnitude of the field becomes more finely divided, the lateral corre- 
lation scale size would be less than the \z/(awo) indicated by eq. (57). 
This speculation is borne out by some computer simulations.’® A better 
approximation might be to replace the inner integral of eq. (39) by 
zp(& — w2/2, n — vz/2). This would move the single equivalent random 
phase-changing screen to a point midway along the path. However, 
neither approximation is particularly satisfactory, and it is obviously 
safer to use the unapproximated eq. (39), although this would probably 
require numerical evaluation. 


VI. CONCLUSIONS 


The beam propagation method, based on the parabolic approxima- 
tion to the wave equation, has been applied to the propagation of a 
laser beam through the clear but turbulent atmosphere. Papoulis’ 
extension to optical fields of Woodward’s ambiguity function was used. 
The resulting formula for the lateral mutual coherence function of the 
propagating laser beam, and hence its mean intensity profile, agrees 
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with that of Tatarskii. The method has the advantage over alternative 
approaches of greater physical clarity. Incorporation of the effects of 
very-large-scale irregularities and of receiver directivity is then very 
simple, as also is the estimation of signal statistics and allowing for 
the consequences of spatial nonstationarity of the turbulence. 
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Simple Analytical Representation of Antenna 
Spatial Radiation Patterns With Application to 
the Pyramidal Horn-Reflector Antenna 
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The spatial (three-dimensional) radiation characteristics of directive anten- 
nas are often needed in radio interference calculations and predictions. Pre- 
sented is a method that allows a fast computation of the spatial radiation 
envelope characteristics of antennas from measured pattern information. This 
is achieved by fitting the measured data to simple functional forms that are 
based on salient physical properties of the antennas. An example is given in 
which radiation envelopes for a pyramidal horn-reflector antenna, widely used 
in AT&T service, are calculated from measured data. Superpositions of quad- 
ratic functions to fit main radiation lobes and logarithmic functions to repre- 
sent the side-lobe envelopes are being used, and good agreement with the 
measured data is demonstrated. 


I. INTRODUCTION 


Ground scattering is a major source of interference in microwave 
communication links.’ Its analysis involves repeated computations 
incorporating the antenna directivity in different directions through- 
out its three-dimensional (3D) coverage. 

For a mathematical representation of the 3D directivity pattern of 
the antenna to be applicable for such purposes, it should be compatible 
with the data storage and computability constraints and commensur- 
ate with the accuracy requirements of the analysis package. Direct 
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representation of measured data, for instance, requires storage of 
densely sampled data, about 20 points per beamwidth in every mea- 
surement cut, and a fraction of a beamwidth separation between cuts, 
resulting in the immense number of 400,000 points in the data storage 
for a 1°-beamwidth antenna. Interpolation methods, based on the 
bandlimitedness of the antenna spatial spectrum, can reduce the 
required database by at least two orders of magnitude,”* but many 
applications require even further simplicity. Such a simplification can 
be offered by approximating the radiation pattern by its envelope only 
and disregarding the detailed side-lobe structure, which varies from 
one antenna unit to another of the same type and varies rapidly with 
frequency. 

The envelope surface, representing a local angular average (or peak 
cover) of the radiation pattern, is much smoother and more repeatable. 
Its generation still requires all side-lobe peaks, and any general pro- 
cedure of surface matching is not as straightforward and simple as 
desired. A major reduction in complexity may be achieved when use is 
made of salient features of the antenna. It is demonstrated, in what 
follows, that a complex surface may be well approximated to a high 
degree of accuracy with relatively simple analytic expressions by 
relying on basic antenna features. 

It is worth mentioning here that simple models, using a single skirt 
function, have found use for specifications,*” but their approximation 
is much too crude for other applications. 

The approximation to the radiation pattern of the Pyramidal Horn- 
Reflector (PHR) antenna,° widely used by AT&T Communications, is 
worked out as an example encompassing only four coefficients in each 
region, out of a lookup table with 29 constants, while maintaining 
tight match over the main beam and no more than 5-dB deviation 
from the side-lobe peaks throughout. This work was briefly summa- 
rized in Ref. 7. 


Il. SURVEY OF PERTINENT ANTENNA FEATURES 
2.1 Symmetries in the antenna pattern 


The field distribution in a radiating aperture is transformed to the 
far field via the Fourier Transform (FT), 


F(u, v) « { dx i: ayf (a, errr (1) 
aperture 


(see Fig. 1), where 
u = sin 6 sin ¢ 
v = sin @ cos ¢, 


and k, being the wave number, equals (27)/n. 


1602 TECHNICAL JOURNAL, SEPTEMBER 1985 





OBSERVATION 





Fig. 1—Coordinate systems for the pyramidal horn-reflector antenna. 


For any cut through the z axis (perpendicular to the aperture), one 
may rotate the coordinates to align the cut with the u and v axes and, 
thus, reduce eq. (1) to a one-dimensional FT 


F(u, 0) « | agen i) dyf(x, y), (2) 
aperture aperture 


where the symmetry rules of Table I apply. 

The aperture of the PHR antenna, for example, is tilted and not 
perpendicular to its main beam boresight (see Fig. 1). In the vertical 
plane, the aperture distribution is, therefore, not real, and the resulting 
radiation pattern not symmetrical. A simple computation technique 
by which the field is projected onto a virtual vertical aperture is widely 
used (see Refs. 8 through 10 in connection with the PHR antenna) 
and produces a symmetrical pattern in the vertical plane, which is 
obviously in error. In the horizontal plane, however, the aperture is 
perpendicular to the pattern boresight and is symmetrical. Further, 
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Table |—Aperture symmetry rules 


Rule ee Distribution Radiation Pattern 


1 f(x) | F(u)| = F|(—u) | 
Ae hives f(x) [e*@) arg F(u) = ae F(-iu) 
2 f(x) = +f(—x) F(u) = +F(- 
3 f(x,y) =f.(x)A(y) F(u, 0) = Fy (u)F, (v) 
f(x) < F,(u 


h<-F,(v) 

4 F(p, ) =i" F(u, v) = F,(w)e?”* 
ae x? + w= vu? + v? 
= tg (y s & = tg"\(u/v) 


(Hankel transform) 


Table 1I—Asymptotic power density 
drop-off for rectangular distributions 
Rectangular Aperture Asymptotic Power 
Distribution Density Drop-Off 
2 
—4 
-6 


Uniform 
Cosine 
(Cosine)? 

(Cosine)® BA 


Taylor 
Dolph-Chebyshev Constant 


SeSece 


with the side walls of the aperture tilted, the field is not separable 
(rule 3 in Table I), nor is the far field. 


2.2 Asymptotic drop-off of the radiation pattern envelope 


The FT relationship between the aperture field and the radiation 
pattern may be evaluated asymptotically for large u. By integrating 
by parts one gets 


b 
Fu) = { f(x)e*™dx 


0 1 n+1 
») (2) ae (x)e Rut (n—1)x/2] | a (3) 


n=0 


which represents the radiation pattern as an asymptotic series of 
diffraction terms by discontinuities at the aperture edges. The leading 
term in that inverse power series is determined by the order of the 
derivative of the aperture illumination that is discontinuous at the 
edges. A representative list of aperture distributions, along with the 
resulting power density drop-off, is listed in Table II (for an extensive 
list, refer to Ref. 11). 

Phase-modulated aperture distributions (e.g., wide flare horns, 
shaped beam antennas) may have an additional, nondiffraction con- 
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Table III—Asymptotic power 
density drop-off for circular 
distributions 


Circular Aperture Asymptotic Power 


Distribution Density Drop-Off 
Uniform us 
1 — (p/a)? ue? 
[1 — (p/a)’)’ ua? 
(1 — (x/a)?}° u- 
Circular Taylor Tr 


tribution resulting from saddle-point integration, which applies prin- 
cipally to the main beam (see, for example, Ref. 12). 

The radiation pattern of a planar aperture with rectangular sepa- 
rable distributions (rule 3 in Table I) is a product of the principal 
plane patterns. Polar separable distributions generate patterns with 
drop-off rate u-"""/ by integrating the Hankel transform? by parts 
and using asymptotic expressions for the Bessel functions. Represent- 
ative circular distributions are listed in Table III.1* Discrete element 
construction of the aperture distribution adds grating lobes to the 
antenna pattern when the elements are periodically displaced. The 
grating lobes are isolated, however, and their location can be predicted 
from the array structure. 

A wedge diffraction pattern is a product of axial and cross patterns, 
with the latter culminating at the shadow boundaries of the incident 
and reflected illuminations and decaying from it as (sin y/2)~' for a 
thin wedge or as (sin 37/2)": for a right-angle wedge,’ y being the 
angle from the shadow boundary. 

Diffraction by strips and cylinders is similarly a product of axial 
and cross patterns, where Snell reflection rules apply to the axial 
pattern. The diffraction pattern thus forms a cone around the axis, 
azimuthally and axially weighted by the respective pattern behavior. 


2:3 Reconstruction of the antenna 3D radiation envelope approximation 


The above survey shows simple pattern behavior for elemental 
radiators when represented in their natural coordinate systems. The 
generic form 


F(u)(dB) = a — b logiou (4) 


may be used on each of the principal axes of the pattern of a separable 
distribution or edge diffraction and on representative radial cuts for a 
nonseparable distribution where azimuthal interpolation functions can 
close the gap. Paraboloids, circular or elliptical, are used to match the 
peak regions. 
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Contributions to the antenna radiation pattern come from the 
illumination of the aperture and its edges, diffraction by the feed and 
structural members, and the weather cover. All these can be classified 
by the categories surveyed in the previous section, and the character- 
istic pattern of each can be traced on the 3D antenna pattern in 
regions where it dominates. These traces are easiest to identify on the 
sin 0, ¢ polar plot of the antenna pattern, pivoted around the boresight, 
where they take elliptical shapes. For instance, an edge slanted by an 
angle a from boresight (z axis) in a plane slanted by @ from the xz 
plane, diffracts the outgoing wave on a cone, the trace of which is an 
ellipse with axes 


a=sina 


b = (1/2) sin 2a (5) 


DIFFRACTION BY A SLANT EDGE 


DIFFRACTION CONE TRACE 
ON THE RADIATION SPHERE 


@2=sin@ 


b = 1/2 sin 2@ 


Kc 


x 


PROJECTIONS 


ran 


Fig. 2—Projections of the edge diffraction cone. 
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tangent to a line through the boresight forming an angle 8 with the y 
axis (see Fig. 2). 

Once skeleton shape matching is obtained, parameters of eq. (4) are 
adjusted to match the envelope of each contributor in its natural 
coordinate system. The partial patterns are then retransformed to the 
antenna coordinate system where final patch up might be required in 
transition regions. 

The desired application of the radiation envelope approximation 
should determine the coordinates of representation and the transfor- 
mation formulas. In analyzing scattering interference in terrestrial 
transmission, for instance, the azimuth (AZ)-elevation (EL) coordi- 
nate system blends with the computations of the model much better 
than 6, ¢ coordinates, and the transformations and approximations 
are best done directly in that representation (see Fig. 1). 


Ill. THE PYRAMIDAL HORN-REFLECTOR ANTENNA PATTERN 


The PHR antenna is extensively used in terrestrial microwave links 
(see Fig. 3). The measured data of the frontal hemisphere of its 3D 
radiation pattern at 4 GHz consists of 91 ¢ cuts made every 1° with a 
sampling rate of 0.08° totaling about 200,000 measured points.’ The 
sin 6, ¢ polar plot of its radiation distribution for horizontal polariza- 





Fig. 3—The AT&T pyramidal horn-reflector antenna. 
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Fig. 4— Radiation pattern for the pyramidal horn-reflector antenna of Fig. 3 in sin 0, ¢ 
coordinates at 3900 MHz, horizontal polarization. 


tion is shown in Fig. 4, truncated at 60 dB below the main beam peak, 
while the accompanying 3D pattern is shown in Fig. 5 in AZ-EL 
coordinates. The trace a in Fig. 4 could be matched to an ellipse with 
a = 14.5°, 6 = 14.5° and identified as side edge diffraction. The trace 
b, on the other hand, matches an ellipse with a = 3.6°, 6 = 14.5° 
corresponding to the side blinder (see Fig. 3).’° A closer look at the 
side blinder attachment detail shows a step at the aperture edge, 
allowing for the aperture edge diffraction to dominate on one side and 
to be shadowed by the side blinder on the other. The large diffused 
lobe at d is due to reflection by the weather cover emerging down after 
a second bounce from the reflector (see Fig. 6), while the one at c is a 
spillover of the horn field illuminating the top edge of the reflector. 
The flare of the side-lobe ridge at the top and at the bottom is 
attributed to the curved top edge of the aperture. 
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Fig. 5—Radiation pattern for the pyramidal horn-reflector antenna in AZ-EL coor- 
dinates at 3900 MHz, horizontal polarization. 


The aperture field of the PHR antenna has the TE,-mode distri- 
bution prevailing in the pyramidal horn, blown up by the reflector. 
The horizontally polarized field strongly illuminates the side walls but 
is much weaker at the upper and bottom edges. The case is reversed 
with vertical polarization as shown in Figs. 7 and 8; the side wall 
diffraction is highly suppressed, while that of the top and bottom 
edges is very strong at the center and decays to the sides. The spillover 
lobe, c, is much stronger, but the window lobe, d, is similar to that of 
the horizontal polarization. 


IV. THE RADIATION ENVELOPE APPROXIMATION FOR THE PHR 
ANTENNA 


The contributors to the radiation pattern, identified by examination 
of both the pattern and the antenna, are aperture illumination taper, 
top edge (curved), bottom edge, side edges (slanted a = 14.5°, B = 
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Fig. 6—Pyramidal horn-reflector antenna with bottom-edge blinder and window lobe. 


14.5°), side blinders (slanted a = 3.6°, 8 = 14.5°), weather cover 
(window lobe), top-edge spillover, and bottom-edge blinder (optional). 
Each of these contributions is approximated by the following generic 
function in the peak regions: 


§& = §max — Ku(u — Uo)” i K,(v a Uo)”, (6) 


where gmax is the peak level in decibels below the main beam peak, u 
and v are the local principal plane coordinates [see eq. (1)] for each 
contributor, and K,, K, are the parabolic coefficients to be matched. 
In the fall-off regions, 


§ =a, + a, — bylogiou — b,logiov (7) 


supplies the envelope, with a, a,, b,, b, coefficients to be matched. 
The local coordinate systems are then transformed to the antenna sin 
AZ, sin EL coordinate system for preserving computational economy 
in the repeated transformations between the antenna and the scatter- 
ing model’ coordinates. Such a simplification is made possible by the 
fact that the antenna boresight is almost horizontal and its azimuthal 
plane aligns with that of the scattering model to enough accuracy. The 
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Fig. 7—Radiation pattern for the pyramidal horn-reflector antenna in sin 0, ¢ coor- 
dinates, vertical polarization. 


sin AZ, sin EL coordinate system is presented in Fig. 1 and is related 
to the spherical coordinates via 


sin EL = sin @ sin ¢ =u 

sin AZ = sin 6 cos ¢/ V1 — sin? 6 sin? 
and 

cos 6 = cos AZ cos EL 

sin ¢ = sin EL/ V1 — cos? AZ cos? EL. 


The top edge contribution is flared to +15° by scaling the AZ coordi- 
nate input to eq. (7). 

The partial contributions to the radiation envelope thus obtained 
are drawn in Figs. 9 through 12 and 14 through 17 for horizontal and 
vertical polarizations, respectively. The overall radiation envelopes, 
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Fig. 8—Radiation pattern for the pyramidal horn-reflector antenna, vertical polari- 
zation. 


drawn in Figs. 13 and 18 for these polarizations, respectively, are then 
represented by the largest partial contribution at every point, all the 
rest being ignored. Note that the three contributions aperture illumi- 
nation taper, top edge, and bottom edge have been combined into one. 
The PHR antenna radiation envelope can, therefore, be well approx- 
imated by single functions of the type (6) or (7) in every region 
characterized by their respective coefficients. The total number of 
constants used by the program for each polarization is 36, only four 
or five of which are used in any individual function. Also, transfor- 
mations using eq. (5) are required for the side edge and side blinder 
contributions. 


V. SUMMARY 


A procedure was described by which a simple and computationally 
economical mathematical model can be constructed to approximate 
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Fig. 9—Partial radiation envelope due to aperture and top and bottom edges, hori- 
zontal polarization. 
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Fig. 10—Partial radiation envelope due to right edge, horizontal polarization. 
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Fig. 11—Partial radiation envelope due to window lobe, horizontal polarization. 
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Fig. 12—Partial radiation envelope due to spillover, horizontal polarization. 
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Fig. 13—Overall radiation envelope for the pyramidal horn-reflector antenna, hori- 
zontal polarization. 
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Fig. 14—Partial radiation envelope due to aperture and top and bottom edges, vertical 
polarization. 
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Fig. 15—Partial radiation envelope due to right edge, vertical polarization. 
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Fig. 16—Partial radiation envelope due to window lobe, vertical polarization. 
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Fig. 17—Partial radiation envelope due to spillover, vertical polarization. 


ANTENNA | 1621 










Lacs oS 
OS Sos 
SOSA SOT ANO 
SSSI 
oe 0' SS oS 













oe 
is 5? /] Y, Dera tigtra bene mOmorrerratesenn 
5 SSM) YOO eee 
see FESS SOS OOS EOE 
a x y i} (LEE SSSR 
& OPCS 
3 ae Sos rou 
Z 
‘se %S 


Fig. 18—Overall radiation envelope for the pyramidal horn-reflector antenna, vertical 
polarization. 


the 3D radiation envelope of directive antennas by making proper use 
of the salient antenna features. Accuracy is a parameter in such a 
model. Test computations of carrier-to-interference ratios executed by 
using this model versus the measured 3D pattern in the terrain 
scattering interference model’ agreed to better than 3 dB in all cases. 
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This paper describes the design, testing, and measured performance of a 
rate 11/12 self-orthogonal convolutional codec that meets the bit-error-rate 
objective of M-state quadrature amplitude modulation systems in terrestrial 
radio transmission. The objective is to reduce a bit error probability of 10-6 to 
107?° or smaller. In fact, the measured output error probability is well below 
107?° when the channel error probability is below 107°. 


I. INTRODUCTION 


To maintain the low bit error rates—as required for high-quality 
data transmission in the face of increased demand for bandwidth 
efficiency—through the use of M-ary quadrature amplitude modula- 
tion (M-QAM) signaling, application of error-correction coding in 
terrestrial digital radio transmission may be desirable. Accumulated 
“randomly scattered” errors from different error sources may make it 
difficult to meet an objective of very low average background error 
probability, for example, a 107° Bit Error Rate (BER) with a high- 
level modulation such as 64-QAM. 

As will be explained in this paper, convolutional coding was consid- 
ered as a potential candidate for this task. To answer some of the 
implementation questions, and because the code seems very attractive 
for high-speed transmission, we developed a rate 11/12 double error- 
correcting convolutional codec and measured its actual performance 
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in terms 01 codec input/output bit error probability. The code effi- 
ciency is 0.9166 and is in the range that is justifiable for spectrally 
limited terrestrial radio applications. The intention is to have the 
process of encoding and decoding applied to data rates in the high 
megabit range. As such, there is a need for considerable parallel 
processing in the encoder and decoder realizations. In this paper we 
describe a procedure for such processing for a rate 11/12 convolutional 
code. The parallel processing can be implemented using standard logic 
elements. 

After a general description of convolutional coding, we describe the 
implemented codec in Section I and then present the test setup in 
Section II. Finally, in Section III we present the measured performance 
of the codec. Section IV provides our conclusions. 


1.1 Code structure 


In general, a convolutional encoder can be assumed to be a linear 
sequential network that maps a block of kp parallel bits entering the 
circuit into an no > Ro bit block over a certain period of time. If the 
bits leaving the encoder are the original data bits plus (no — ko) parity- 
check bits, defined by the particular code polynomials, the code is 
called systematic. The ratio ko/no is defined as the code rate. In 
convolutional coding, the parity bits in a given block have not only 
been affected by the data bits in the present block but also the 
preceding blocks. (This is not the case for block codes.) 

The constraint length of the code N is the number of bits no in a 
coded block multiplied by the number of blocks m checked by the 
No — ko parity checks. For block codes, m = 1. 

In general, convolutional codes have the same limitations and, 
roughly speaking, the same inherent capabilities as block codes. 

Like block codes, convolutional codes are capable of correcting 
random errors, burst errors, and combinations of random and burst 
errors. Since the parity bits in a convolutional code check information 
symbols in the blocks preceding the present block, the basic parity 
matrix is of the form 


h = [Ph-s OPT.» O --» Pol], 


where (no — ko) by ko matrices P? are arbitrary, and O and I represent, 
respectively, the zero and identity matrices of order no — ko. The 
encoding process associates (mo — ko) parity checks, as specified by the 
matrix h, to every block of kp bits of the entering information, once 
every No channel-bit times. 

Decoding of a convolutional code is possible by algebraic or sequen- 
tial methods. Although both techniques deal with convolutional codes, 
they do so in entirely different manners. As a result, most of the 
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definitions applied in coding are of different forms, depending on the 
way that the code is decoded. 

In algebraic methods, the decoder bases its decisions on the check 
digits within a constraint length, but in the sequential methods the 
decisions are made based on a section of the syndrome that is much 
longer than the code-constraint length. This substantially improves 
error-correcting performance. 

Decoding simply associates an error pattern with the syndrome. 
After the decoder calculates the syndrome, it uses it to decode the 
oldest block in its registers. Like block codes, a convolutional code can 
be modeled in terms of parity-check and generator matrices. 

An n-tuple b is a code word in the convolutional code family de- 
scribed by h if and only if bH? = 0, where H7 is the transpose of the 
parity-check matrix 

P@I 
H=|P?  OP§I ; 
PT_, OPT. O..- Pal 
where P/, O, I were defined in the equation for h. The sequence b can 
be obtained by multiplying the information-bit sequence d = d,-; --- 
dd by the generator matrix of the code, which is 


IPo OP,--- OPn-1 
G= IPo «++ OPm-2 |, 
IPo 
that is, b = dG. 

It can be seen that GH? = 0. The syndrome of a received n-tuple-r 
is defined as rH’ = s. Clearly, all n-tuples containing errors (not a 
code word) have nonzero syndromes. 

Self-orthogonal codes are a class of convolutional codes that are 
rather simply implementable, and they can also be decoded with 
majority-logic decoding (a nonsequential decoding approach). The 
disadvantage of these codes is that for large values of np and kp the 
random-error-correction ability of the code decreases, that is, their 
minimum distance gets small. For ease of implementation only the 
self-orthogonal codes with no — kyo = 1 are usually considered for 
construction. In self-orthogonal codes with minimum distance d, at 
least d — 1 rows of the parity matrix H are orthogonal (in the coding 
sense) on each of the kp bits of the zero block. For these codes, it can 
be shown that 


(no —1)(d : W(d = 2), i} 


where N is the constraint length and d the minimum distance of the 
code. 


N<0| 
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Suppose that a self-orthogonal code has the ability of correcting t = 
(d — 1)/2 random errors. If ¢ or fewer errors occur within a constraint 
length of the (m — 1)th block, it can be shown that it is possible to 
arrange a set of J = d — 1 orthogonal check sums on each data bit in 
the (m — 1)th block of this code. It has been proved that the value 
assumed by the majority of these check sums will always be the correct. 
value of the noise digit that was added to the considered data bit.? 
Therefore, by adding this bit to the data bit the correction can be 
made. 

The threshold decoder corrects errors by reencoding the received bit 
stream, adding the regenerated parity bit to the corresponding received 
parity bit, and then forming the syndrome. Then the error pattern 
associated with the syndrome is added to the (m — 1)th block of the 
received sequence. For practical limitations, the m-bit section of the 
(semi-infinite) syndrome that is within a constraint length is stored 
in the syndrome register. As mentioned above, J = d — 1 of the check 
sums represented by the syndrome bits are orthogonal on each data 
bit in the (m — 1)th block. 

Each of the ky majority gates functions as a voter for one of the 
orthogonal check-sum sets. Therefore, the syndrome register bits are 
weighted according to the coefficients of the generator polynomials to 
form the orthogonal check-sum sets, and fed to ky majority gates 
(threshold circuits). Hence, the number of inputs to each majority 
gate is the same as the number of terms in each generator polynomial, 
that is, J =d—1. 

The output of the ith threshold circuit, which is a zero when more 


than 

J/2, J even 

(J — 1)/2, J odd 
of the inputs are zeros, and one otherwise, is added to the ith data bit 
in the (m — 1)th block in the parity regenerator (reencoder) circuit. 
This correction bit is also used to invert each bit of the syndrome that 
is connected to the ith threshold circuit. After correcting all the errors, 
the bits are shifted out so that the next block can be processed. 

With this introduction we can proceed with the design of the code 

for microwave radios. 


1.2 Design model 


The self-orthogonal convolutional code applied here, as stated ear- 
lier, is a rate 11/12 code. That is, the encoder appends one parity bit 
to the end of each block of 11 information bits to form a 12-bit coded 
block. The code constraint length, that is, the number of bits checked 
by every parity bit, is N = 1716. Therefore, the maximum term in the 
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code generator polynomial is of the order of 142, which is the length 
of the encoder and syndrome shift registers. The code generator 
polynomial set is? 


G=1+D+D°+D' 

Go=1+ D> + D* + D* 
Gz; =1+D°+D* + D® 
G,=1+D"+ D”® + D” 


G; =1+ D” + D*® + D® 
Ge=1+D"+ D+ pD™ 


G7=1+D"+D*%+D™ 
=1+ D*” “ts D}” a Di 
Gj=1+D°+D" + D® 
Gi = 1 + D*" + :D'® + +D)” 
Gi =1+ D® + D*” + Dp. (1) 


We use these polynomials to form the encoder shift register shown in 
Fig. 1 and the syndrome register of the decoder in Fig. 2. The details 
of shift register tap connections for this type of code are shown in Fig. 
3. The blocks in the block diagrams in Fig. 1 through 3 will be described 
in the following paragraphs. 
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Fig. 1—Block diagram of encoder. 
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Fig. 2—Block diagram of decoder. 
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Fig. 3—Tap connection details. 


The encoder circuit shown in Fig. 1 takes a serial data stream at a 
clock rate of R, and, via a Serial-to-Parallel (S/P) converter, the 
stream is converted to 11 parallel streams, each clocked at the rate 
R, = R/11. To realize the S/P converter circuit, an 11-bit serial-in/ 
parallel-out shift register at the rate R was used. The clock conversion 
is performed by the divide-by-eleven circuit shown in Fig. 1. The 
timing alignment of the data streams is done through a set of flip- 
flops clocked at R,. The encoder shift register consists of 142 delay 
elements, along with 32 exclusive OR gates placed at locations deter- 
mined by the code generator polynomials. The delay elements were 
realized by using 8-bit Transistor-Transistor Logic (TTL) shift reg- 
ister Integrated Circuits (ICs). The generated parity and 11 data bits 
are then combined through a TTL multiplexer addressed by a 12-bit 
counter. The multiplexer has to be clocked at Rz = (12/11)R, or more 
simply, Ro = 12R;. Hence, a clock multiplier was needed to generate 
the 12th harmonic of R;. A digital phase-lock frequency multiplier was 
designed for this purpose. The circuit is shown in Fig. 4. It consists of 
a phase detector IC, a low-pass loop filter, a voltage-controlled multi- 
vibrator, and a divide-by-twelve counter. To generate the Rz clock, a 
harmonic filtering method was tried first and discarded in favor of the 
phase-lock frequency multiplier. As stated earlier, the clock signal 
generated by the phase-lock frequency multiplier is used to address 
the multiplexer (mux) IC and the output of this is reclocked through 
a single flip-flop by the Rez clock. 

A block diagram of the decoder is shown in Fig. 2. The received 
coded data at rate R, = (12/11)R is passed through an S/P converter 
to obtain 12 parallel bit streams. To alleviate the encoder/decoder 
synchronization problem in the experimental model, the R, clock 
generated on the encoder board was hard-wired to the decoder front- 
end circuit, where a divide-by-twelve counter was used to generate 
R, = R./12. This clock, in turn, is used to time align the 12 data 
streams through a set of flip-flops. The reencoder circuit is identical 
to the encoder shift register shown in Fig. 1. The generated parity and 
received parity bits are added through an exclusive OR gate to form 
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Fig. 4—Phase-lock frequency multiplier. 
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the syndrome bits. The syndrome register is set up the same way as 
the reencoder shift register. The syndrome register outputs are fed to 
a set of 11 threshold circuits. Each threshold circuit, as explained 
earlier, performs a majority logic vote; that is, if more than half the 
inputs to each circuit are binary ones, the output of that circuit will 
be a one, otherwise it will output a zero. Each binary one at the 
threshold circuit output indicates an error on a particular data line 
being checked by that threshold circuit. Because the code implemented 
here is a double error-correcting code, there are four inputs to each 
threshold circuit that operate on the orthogonal check bit set. 

To have the error correction correctly performed, the 12 data lines 
have to be delayed by one syndrome register length. This can be done 
by using two RAMs in parallel; however, in this experimental model 
we used a set of shift register ICs, each containing a 128-bit delay and 
clocked at R, to acquire the delay needed. The error indicators are 
then modulo-2 added to the proper data bits, and the corrected data 
lines are multiplexed by a similar approach, as explained in the encoder 
circuit description. Again, a phase-lock frequency multiplier is used to 
provide the information clock rate at the decoder output. The multi- 
plexer output is reclocked through a single flip-flop at the clock rate 
R. In addition, the error indicators are used to remove the effect of 
corrected errors from the check bits entering the syndrome register. 

The fact that the main encoding/decoding operations here are done 
at a relatively low speed, because of the input serial-to-parallel con- 
versions, makes this type of codec attractive for high-speed data 
transmission. Next we discuss the test procedure. 


Il. TESTING 


To check the performance of the encoder and decoder, a test drawer 
was designed and built. The test drawer consisted of the encoder/ 
decoder circuits, a thermal noise source, a summing amplifier, two 
attenuators, and a switch, as shown in Fig. 5. Functionally, this test 
drawer was to measure the error rate at the input/output of the codec. 

The BER test set used here produces a random bit stream, repre- 
senting the information bits, which is then encoded through the 
encoder. Noise is then added to the encoded signal before it enters the 
decoder. In order to measure both the input and output error proba- 
bilities with the same BER test set, we used the following method. 
The syndrome register flip-flops are set during the normal course of 
error correction and the decoder output stream closely resembles the 
encoder input data stream. However, if we reset the decoder syndrome 
register, the decoder error-correcting function is blocked. Conse- 
quently, the decoder outputs the unmodified noisy bit stream. There- 
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Fig. 5—Block diagram of test setup. 


fore, the BER test set will measure the input error probability. The 
syndrome register resetting operation was done by the switch shown 
in Fig. 5. The noise generator was a standard thermal noise source. 
The step attenuators were to vary the signal and noise power in order 
to display different error rates at the input. 

The test was performed at an input rate of 10 Mb/s. However, 
because of the serial-to-parallel operation at the encoder/decoder 
input, up to a 250-Mb/s data rate can be handled by this codec, using 
standard TTL integrated circuits. 


Ill. CALCULATED AND MEASURED RESULTS 


An approximate expression on the performance of the self-orthog- 
onal convolutional codes for low channel error rates is presented in 
Ref. 2, and more details can be found in Ref. 3. The result is an 
asymptotic, upper bound on the bit error probability of the decoded 
bit. The bound is given by 

1 = N i N-i 
Bee. ( )p (1 = py, (2) 
where 


= asymptotically (in No) 

= white noise spectral height 

bit error probability after decoding 

constraint length = 1716 

code rate = 11/12 

number of bit errors corrected per constraint length = 2 
= input bit error probability. 


DO wean 
ll 
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The expression for P, in (2) is for any particular decoded bit in the 
first group in a constraint length, under the assumption either that 
(1) decoding is direct (without feedback syndrome correction) and the 
immediately preceding constraint span was free of decoder input 
errors, or (2) decoding is with feedback and the immediately preceding 
constraint span was free of decoder output errors. It happens to be 
valid, in general, only by virtue of the fact that the effects of prior 
history of the decoder are outweighed by the excess probability— 
included in eq. (2)—of those triple or higher weight input error 
patterns that do not cause output errors. This bound for the code 
implemented here is shown as one of the curves in Fig. 6. 

As stated earlier, the set of error indicators can be used as a feedback 
to clean up the syndrome register. To investigate how much improve- 
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Fig. 6—Performance of a rate 11/12 codec. 
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ment is achieved by this operation, the output error probability mea- 
surements were taken for two cases: with and without syndrome error 
correction. These results are also shown in Fig. 6. As one can observe, 
having the set of feedback error indicators connected improves per- 
formance. In this case, the output probability of error is improved by 
almost one half of an order of magnitude at an input error rate of 10~¢ 
by having the feedback links in Fig. 2 connected. 

As stated earlier, the difference between the approximate bound and 
the measured performance of this double-error-correcting code can be 
due to the fact that a self-orthogonal, double-error-correcting, convo- 
lutional codec can correct many—triple, quadruple and longer—error 
patterns. However, the bound in eq. (2) only takes into account double- 
error correction. 

Note that the BER test setup simulated only thermal noise effects. 
The possible effects of modem implementation and other nonthermal 
effects on a real channel need to be characterized. If these effects 
merely increase the decoder input error rate for a given bit energy to 
noise density (E,/No), but maintain a pure Poisson distribution of 
those errors, then the output BER versus input BER results of the 
decoder test will still apply. Conversely, if the other effects cause 
significant departure from a Poisson arrival of decoder input errors, 
then the output BER performance versus input BER performance of 
the decoder will degrade from the test results previously described. In 
particular, if there is a tendency towards error clustering, a degradation 
could occur. For instance, clusters of three errors or more in a con- 
straint span that are more frequent than that predicted by a Poisson 
model could be a source for performance degradation. 

For example, in a gray coded 16-QAM modem, even a very small 
residual phase offset error in detection would significantly increase 
the probability of 2 bit errors in a 4-bit baud. Then both errors are 
prone to erroneous decoding if a third input error happens to occur 
nearby. When the objective is a 107'° output BER, even a very slight 
effect of this sort can quickly result in an order-of-magnitude degra- 
dation in the codec performance. 


IV. CONCLUSIONS 


This paper has described the design, testing, and performance of a 
rate 11/12 self-orthogonal convolutional codec that meets the BER 
performance objective of M-QAM radio systems. The objective was to 
convert a bit error rate of 10~° to an equivalent error rate of 10~*° or 
better. The measured error rate is well below 107° at an input error 
rate of 107°. 
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Much has been learned in recent years about the existence, determination, 
and properties of power-series-like expansions for expressing a nonlinear 
system’s outputs in terms of its inputs. In particular, the existence and local 
convergence of expansions, and of certain “associated expansions,” for impor- 
tant large classes of systems are now well established. While the focus of 
attention has been on questions such that the size of the inputs for which 
convergence is guaranteed is not the main issue, some related material has 
appeared that bears on the problem of determining the extent of the region of 
convergence. The result most closely related to this paper is a recent theorem 
that gives necessary and sufficient conditions under which f~ has a generalized 
power-series expansion when f is an invertible locally-Lipshitz map between 
certain general subsets of two complex Banach spaces. In applications involv- 
ing nonlinear models, ordinarily only real spaces of inputs and outputs are of 
direct interest. A “complexification” involving a certain solvability condition 
in complex spaces has to be able to be carried out to use the theorem referred 
to above. This paper reports on pertinent general results concerning invertible 
maps between subsets of real Banach spaces, with their complex extensions, 
and with generalized power-series expansions in both real and complex spaces. 
It focuses on questions concerning expansions for inverses of maps defined in 
real spaces. The results show that for a very large class of systems that have 
input/output maps, the ability to complexify is not just a useful sufficient 
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condition for expandability, but is in fact the key condition for an input/ 
output map to be representable by a generalized power-series expansion. 


I. INTRODUCTION 


Convolution operator input/output representations for linear sys- 
tems are well understood and are widely used. With regard to corre- 
sponding representations for nonlinear systems, much has been 
learned in recent years about the existence, determination, and prop- 
erties of power-series-like expansions for expressing a system’s outputs 
in terms of its inputs (see, for example, Refs. 1-7). In particular, the 
existence and local convergence of expansions, and of certain “asso- 
ciated expansions,”® are now well established for important large 
classes of systems. 

While the focus of attention in Refs. 1 through 6 has been on 
questions such that the size of the inputs for which convergence is 
guaranteed is not the main issue, some related material has appeared 
that bears on the problem of determining the extent of the region of 
convergence. The result most closely related to this paper is a theorem 
in Ref. 7 which gives necessary and sufficient conditions under which 
f-' has a generalized power-series expansion (in the sense of our 
Section 2.1) when f is an invertible locally-Lipshitz map between 
certain general subsets of two complex Banach spaces. Another theo- 
rem in Ref. 7 yields an algorithm for obtaining the expansion whenever 
it exists, and these two theorems are used therein to prove results 
concerning a certain system model considered in Ref. 2 and in earlier 
papers. 

In applications involving nonlinear models, ordinarily only real 
spaces of inputs, outputs, and intermediate signals are of direct inter- 
est. A “complexification” involving the existence of a certain inverse 
map defined on a complex space has to be able to be carried out to use 
the theorems in Ref. 7. One of the main applications of the results in 
this paper is a proof that in an important general setting this com- 
plexification condition is always met when certain invertibility and 
expandability conditions are satisfied in the underlying real space. As 
a consequence, for a very large class of systems that have input/output 
maps, the ability to complexify emerges as the key condition for an 
input/output map to be representable by a generalized power-series 
expansion. (Under certain reasonable assumptions these expansions 
reduce to Volterra-like series.)”® . 

To be more explicit, models of the kind mentioned above are 
characterized by five operators: a nonlinear operator N, and four linear 
operators a, b, c, and d. They have an input v and an output w, which 
belong to a space X of functions. Here X is taken to be a real Banach 
space; a, b, and c are assumed to be bounded maps of X into X, and 
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we suppose that WN is defined on all of X and takes X into X. One has 
w = dv + bN(I — cN)“'av (J the identity map on X) subject to some 
natural qualifications, from which it is clear that the study of such 
models* often involves the study of maps of the form (J — cN). The X 
of particular interest to us is the space of real Lebesgue-measurable, 
n-vector-valued functions x defined on [0, ©), with the norm in X 
given by || x|| = max;supiso|x;(t) |, where x;(t) is the 7th component 
of x(t). 

Let Ay and A be subsets of X such that (J — cN) restricted to Ao is 
an invertible map of Ap onto A. Assume that both Ap and A are open 
sets, and that A contains the zero element of X. Under these condi- 
tions, w is well defined for each v such that av € A, the zero function 
is an allowed input, and the set of allowed inputs is open. The question 
that we ask is this: With (I — cN)~ the inverse of the restriction of 
(I — cN) to Ao, assumed to be continuous, when is it true that 
(I — cN)~‘u has a generalized power-series expansion that converges 
for u € A? When it is true, the map from v to w has an expansion 
that converges whenever au € A, assuming (and this is frequently very 
reasonable) that N is such that the existence of the expansion for 
(I — cN)~‘u implies the existence of an expansion for N(I — cN)™u; 
see, for example, Corollary 1 in Appendix A or Theorems 1 and 7. 

Theorem 2 in Section II provides an answer to the question, un- 
der the assumption that N has an extension into a complex 
space @ associated with X, with this extension a certain type of 
globally convergent generalized power series. For the X of particular 
interest, this assumption is a reasonable one, and the corresponding 
@ turns out to be just the natural complex associate of X. The answer 
given by Theorem 2 is that there must be two open subsets Vp and V 
of @ such that: A C V (meaning that u + 10 € V for each u € A; see 
Section 2.1), Ao C Vo, Vis a “star” in the sense that zg € V wheng € 
V and z is a scalar such that |z| < 1, and the map (J — cN) extended 
into & (see Section 2.2), and restricted to Vo, must be a homeomorph- 
ism of Vy onto V, with the inverse of the restriction of the extended 
(I — cN) locally Lipshitz on V. While this necessary and sufficient 
condition’ may look complicated at first glance, its interpretation is 
straightforward: (I — cN)~‘u has a power-series expansion that con- 
verges for u € A if and only if the equation x — cNx = u, when 


* There is an error in the corresponding equation in Ref. 7, where B in (11) should 
be replaced with BN. This does not change the conclusion drawn there from Theorem 
3; see, for example, our Theorem 7. 

t The sufficiency of this type of condition is discussed in Ref. 7, Section 2.4.2. Also, 
with regard to the system model in Ref. 7 (p. 84), note that the existence of an expansion 
for w (in terms of v) implies the existence of an expansion for y and thus for x if, for 
example, B is the identity operator. 
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extended into the complex space &, is, so-to-speak, uniquely locally- 
Lipschitz solvable in some open subset of & containing the points of 
Apo for all right sides belonging to V, where V is any open star in # 
that contains the elements of A. (See Section 2.3.1. In this connection, 


notice that an open ball centered at the origin is an example of a star.) 
_As is suggested by the application described above, the results in 


this paper are concerned with invertible maps between subsets of real 
Banach spaces, with their complex extensions, and with generalized 
power-series expansions in both real and complex spaces, with the 
focus on questions concerning global expansions for inverses of maps 
defined in real spaces. Preliminaries are introduced in Section 2.1, and 
Sections 2.2 through 2.6 contain the paper’s principal results. 

There are several natural applications of the material in Section II 
other than the one already discussed. For example, consider again the 
five-operator model described above, and assume that the assumptions 
introduced are met. Assume in addition that an expansion represen- 
tation for (I — cN)~‘u does exist for u € A. Suppose that this expansion 
also converges for u € B, where B is some open subset of X for which 
A CB. Theorem 3 shows that then the map from v to w is in fact both 
well defined and has a generalized power-series expansion for au € B. 

It will become clear that the theorems in Section II are considerably 
more general than the applications discussed above are able to illus- 
trate. For instance, they bear on cases in which the underlying function 
space is a set of functions of more than one independent variable. 
Also, in Section 2.5 corresponding results are given for certain implic- 
itly defined maps. These latter results are useful in, for example, 
studies of globally convergent generalized power-series expansions for 
solutions of differential equations. 


Il. COMPLEX EXTENSIONS AND EXPANSION REPRESENTATIONS 
2.1 Preliminaries 


Throughout the paper X denotes a real Banach space. We associate 
with X (see Ref. 9, p. 312 and Ref. 10, p. 665) a complex Banach space 
@ defined as follows: the elements of & are ordered pairs (x1, x2) of 
elements of X, addition and multiplication obey 


(x1, X2) + (M1, Yo) = (41 +H, Xo + Yo) 
(a + 18)(x1, X2) = (ax, — Bx2, aX, + Bx1), 
and the norm of an element of & is given by 


Il (x1, x2) | = sup [5"(x1) + (x2))”, 


where £ denotes a general, real, bounded linear functional on X. We 
sometimes use x, + ix. to denote an element (x1, x2) of &. 
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The map x — x + 10 of elements of X into elements of & isometri- 
cally* imbeds X into the complex space &. In particular, in this sense, 
@ is a complex extension of X. For example, if X is the space of 
bounded, Lebesgue-measurable, real n-vector-valued functions x de- 
fined on [0, ©), with || x|| = max;sup,;] x;(t)|, then the elements v of 
@ are bounded, Lebesgue-measurable, complex n-vector-valued func- 
tions defined on [0, ©), and (see Appendix B) one simply has || v || = 
max;sup; | v;(¢) |. 

A star in @ means a subset S of & such that zv € S for v € S and 
any complex scalar z with |z| < 1. A subset S of @ is c-convex if for 
any bounded open set A of complex numbers, we have (v + Au) C S 
whenever (uv + Tw) C S, where I is the boundary of A. 

In the paper, Xo denotes a second real Banach space and &p stands 
for its complex extension. We allow the possibility that Xo = X. 

Now let Y and W be any two Banach spaces, both real or both 
complex. 

Given any positive integer m, by an m-linear map q from Y™ 
into W we mean that q(y1, ---, Ym) is linear (i.e., additive and 
homogeneous) separately in each y;. Such a map is symmetric if 
q(¥1, +++ , Ym) is Symmetric in the variables y, --- , ym. Amap h from 
Y into W is called a homogeneous polynomial of degree m if there 
exists an m-linear g from Y” to W such that h(y) = q(y, ---, y) for 
all y.‘ A homogeneous polynomial of degree zero is a constant map. 

For S a subset of Y, let A(S, W) denote the set of all maps p from 
S into W such that there are homogeneous polynomials h,, of degree 
m (m=0,1,---) from Y to W, with the properties that Y7-0 hm(s) 
converges in W for each s € S, and 


p(s) = » hnl(s), s ES. (1) 
m=0 
The set A(S, W) is, of course, a set of maps p that admit a generalized 
power-series expansion in the sense indicated. If S contains an open 
ball in Y centered at the origin, then the expansion (1) for any p € 
PA(S,W) is unique in the sense that if 


p(s) = yi &m(s), s ES, 


with each g,, a homogeneous polynomial of degree m, then g,, = hm for 
all m (see Ref. 11, p. 174 and Ref. 1, Section 2.7). 
Finally, we say that p belongs to Ar(S, W) if p € A(S, W) and for 


* By the Hahn-Banach theorem, || x |] = sup{&(x): || € {| = 1}. 
tThe same class of maps is obtained if “m-linear” is replaced with “symmetric m- 
linear.” 
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each positive m there is a continuous symmetric m-linear q,, from Y”™ 
into W such that h,,(s) = dm(s, ---, s) for all s. In particular, then 
each h,, is bounded in the sense that there is a constant p,, > 0 such 
that || hn(s) || < pm|ls||” for all m and s, and every h,, is Fréchet 
differentiable on Y. 


2.2 Inverse maps and necessary conditions for the existence of series 
representations 

Throughout this section, and in Sections 2.3, 2.4, and 2.6, f is a map 
from Xo into X, A and Apo are open subsets of X and Xo, respectively, 
with 0 € A, f restricted to Ap is a homeomorphism of Ap onto A, and 
g: A — Apo is the inverse of the restriction of f. It is assumed that there 
is an f* © Ar(Bo, #) such that f(x) = f*(x + 10) for x © Xp. [Of 
course, by f(x) = f*(x + 10) we mean that f(x) + 10 = f*(x + 10).] 

The following extension theorem is this paper’s main result. 
Theorem 1: If g has a power-series representation in the sense that g € 
P(A, Xo), then there are open sets V and Vp in & and &p, respectively, 
together with a map g*:V — Vo such that V is a c-convex star, A C V, 
Ao Cc Vo, and 

1. the restriction of f* to Vo is a homeomorphism of Vo onto V with 

inverse g* 
2. g* € Pr(V, Bo) 
3. g(x) =g*(x +10), x EA. 


2.2.1 Proof of Theorem 1 


Two lemmas are used in the proof. The first of these follows. 


Lemma 1: Let D be an open subset of X with 0 € D, let hh € A(D, Xo), 
and assume that h is continuous on D. Then there are an open c-convex 
star Z C B and a map h* from Z to & such that DC Z, h* € 
Pr(Z, Bo), and h(x) = h*(x + 10) for x € D. 


Proof of Lemma 1: We have 
h(s)= XY hp(s), s &D, (2) 
m=0 
where each h,, is a homogeneous polynomial of degree m. Let gm be 
the unique symmetric m-linear map such that hpn(s) = dm(s, --- , s) 


for s € X and positive m (see Ref. 12, pp. 762-3). Let ho = ho + i0, 
and define hi: @ > & for each m = 1 by 


hin (x1 or 1x2) = ») pee 2) Qm(X1, see 5 X1, Xa, +°*, Xe), (3) 
k=0 


with kx,;’s and (m — k)x2’s on the right side. It is not difficult to verify 
that the h>, are homogeneous polynomials of degree m (see Ref. 9, p. 
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313 and Ref. 13, p. 71). By Theorem 5.7 of Ref. 13 there is an open 
subset Z, of & such that D C Z, and the series 


iy h3(v) (4) 


converges for v € Z;. Since D is open and h is continuous, it follows 
(see Ref. 13, Theorems 4.4 and 6.6) that h is analytic in D in the sense 
of Ref. 13, p. 75. Thus, using the hypothesis that 0 € D, h has a power- 
series expansion valid in a neighborhood of the origin, with the terms 
in the expansion continuous homogeneous polynomials. At this point 
the uniqueness result mentioned in Section 2.1 shows that the h,, in 
(2) are continuous. 

By the continuity of the h,,, the g, in (3) are continuous. Using (3) 
and the fact that || x; |] < 1 and || x2|| < 1 are implied by || x, + ix2|| < 
1 [see (7), below], we see that each h3, is bounded in the ball 
l| x1 + ixs |] < 1. This shows (see Ref. 12, Theorem 26.2.4) that the h?, 
are continuous. 

Let Z denote the interior of the region of convergence of the series 
(4), and let h*(v) be the sum (4) for any v € Z. Obviously Z; C Z. 
Since the h}, are continuous, it follows (see Ref. 12, Theorem 26.6.1) 
that Z is a c-convex star. Since it is clear that h7,(x + i0) = h,,(x) for 
x € X, the proof of the lemma is complete. 

Continuing with the proof of the theorem, by Lemma 1 there are an 
open c-convex star VC @ and a map g*:V — & such that A C V, g* 
E Ar(V, Bo), and g(x) = g*(x + 10) forx EA. 

We now turn to our second lemma. 

Lemma 2: Let h be a Fréchet-differentiable map (Ref. 14, p. 149) from 
an open connected subset D of @ into @y. Assume that there is a point 
p in D and an open ball Q in X centered at the origin such that 
(p+ Q)4{sE€ @s=pt (qt id), g € Q} CD and h maps (p+ Q) 
into the origin in @y. Then h vanishes everywhere in D. 


Proof of Lemma 2: Since h is Fréchet differentiable in a neighborhood 
of p, there are (see Ref. 12, Theorems 3.16.2 and 26.3.5) homogeneous 
polynomials H,, of degree m (m = 1, 2, ---) anda o >0 such that s € 
D and 


h(s) = Y An(s — p) 


m=1 


when ||s — p|| < o. Thus, for some positive p < o, we have 
m=1 


for q € {x € X:||x|] < p}, where @ is the zero element of &p. It easily 


REPRESENTATION THEORY 1645 


follows (see Ref. 1, Section 2.7) that H,,(x + 10) = 6 for each m and 
allx € X. 

Consider H,,(x1 + ixg) with m, x;, and x2 arbitrary, and let P,,, denote 
the polar form (see Ref. 12, pp. 762-3) associated with H,,. We see 
that H,,(x; + 1x2) can be written as a finite sum of terms of the form 
cP(y; + 10, +++, ¥m + 10), in which c € {+1, +1} and each 4, is either 
X1 OF X2. On the other hand, P,, (1; + 10, --- ,¥m + 10) can be expressed 
(see Ref. 9, p. 306) as 


1 
(mI) (ay t+) Hala (an + 10) + ++ + em (Im + 10). 
€ 19° + + 9€m=0 
Therefore, using H,,(x + 10) = 6 for x € X, one has H,,(x1 + ixe) = 0. 
This shows that h(s) = @ for s in an open ball in D. Since D is 
connected and h is G-differentiable in the sense of Ref. 12 (pp. 109- 
10), it follows (see Ref. 12, Theorem 3.16.4) that h(s) = 6 throughout 
D, as claimed. We now return to the proof of the theorem. 

Let E(v) denote f*[g*(v)] — v (v € V). Since f* € Pr( Bo, #) and 
g* € A-(V, Bo), f* and g* are Fréchet differentiable on @y and V, 
respectively (see Ref. 12, Theorems 26.6.4 and 3.17.1). Thus, using a 
version of the chain rule for differentiating a composite map, E is 
Fréchet differentiable on V. In addition, the set V is connected 
(because it is a star), and we have 


E(x + 10) = f*[g*(x + 10)] — (x + 10) =0, xEA. 


Choose any point p, € A, and let Q be an open ball in X centered at 
the origin such that (p; + Q) C A. Let p = (p; + 10), and observe that 
E[p + (q + i0)] = 0 for g € Q. By Lemma 2 (with @ = @), E(v) = 6 
(the zero element of #) for v € V. This gives 


f*[g*(v)] =v, vE V. (5) 


Since V is connected and g* is continuous on V, g*(V) is connected. 
The continuity of f* implies that f*7'(V) is open. From (5) it is clear 
that g*(V) C f*71(V). Let Vo denote the maximal connected subset 
(i.e., the component) of f*~'(V) that contains g*(V). Since f*71(V) is 
open, so is Vo. The map f* obviously takes Vo into V. 

Now let F(w) denote g*[f*(w)] — w (w € Vo). It follows from 
Lemma 2 and F(x + 10) = 0 for x € Ag that 


s[f*Ww)] =u, weé Vo. (6) 


Since (5) and (6) hold, f* restricted to Vo is a homeomorphism of Vo 
onto V. The observation that Ay C Vo because Ap = g(A) C g*(V) = 
Vo completes the proof of the theorem. 
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2.3 Complex-solvability criteria for global expandability 


Here we use Theorem 1 to obtain necessary and sufficient conditions 
for the global expandability of the map g introduced at the beginning 
of Section 2.2. With regard to our result, Theorem 2 (below), we say 
that a map h from an open subset D of @ into &p is locally Lipschitz 
on D if for each a € D there are a positive number c, and an open ball 
Ba € D centered at a such that || h(v,) — h(v2) || < cel] v1 — vel] for v1 
and vo in f,. 

Theorem 2: We have g © A(A, Xo) if and only if (1) there are open sets 
Vand Vy) in @ and &, respectively, with V a star, A C V, and Ap C 
Vo such that the restriction of f* to Vo is a homeomorphism of Vo onto 
V, with the inverse of the restriction of f* to Vo locally Lipschitz on V, 
and (11) the spaces @ and &y are homeomorphic, in the sense that there 
is a linear homeomorphism of & onto Bo. 

Proof: The necessity of (i) follows from Theorem 1 and the observation 
that g* in Theorem 1, which belongs to Ar(V, Bo), is Fréchet differ- 
entiable, hence continuously Fréchet differentiable (see Ref. 7, Lemma 
2), and thus locally Lipschitz. Similarly, the necessity of (ii) is a 
consequence of Theorem 1 and the fact that the conclusion of Theorem 
1 implies (see Ref. 15, p. 175, Problem 6) that the F-derivative (i.e., 
the Fréchet derivative) of f* at any point in Vp is an invertible map of 
By onto &. 

On the other hand, if (i) and (ii) are met, then, using Lemma 1 of 
Ref. 7, the inverse H of the restriction of f* to Vo is F-differentiable 
[and thus G-differentiable (see Ref. 12, pp. 109-10)] on V. It follows 
that H € A(V, Bo) (Ref. 12, Theorems 3.16.2 and 26.3.4). Let the 
series for H be given by 


H(v) = > A,zlv), veé V. 
m=0 
By the conditions on f, for any x € A there is a y € Ag such that 
f*(y + 10) = (x + 10). Therefore, using H[f*(v)] = v (v € Vo) and 
Ao C Vo, we see that H takes A into Ao. Thus, since f*[H(v)] = v 
(v € V), we have 


g(x) = } An(x + i0), x EA. 
m=0 

Of course Hj(0 + 10) € Xo. We claim that for each positive m there 
is an m-linear map Q,, from X™ into Xo such that Qn(x, ---, x) = 
H,,(x + 10), x € A. Since the H,, are homogeneous polynomials, we 
need only show that each H,, maps X into Xo, and we do that as 
follows. 

The norm in & has the property that || x; + ix2 || < 6 implies that 
|| x2 || < 6, because, using the Hahn-Banach theorem, 
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ll x2 || = sup |E(x2)| < sup [E(x1)? + E(x2)7]? = |fxr + ixe||. (7) 


Thus, the convergence of ),-0 Hm(x + 10)(x € A) implies that for 
each x € A, the series )-) KH,,(x + 10) converges in Xo, and that it 
converges to the zero element, where K is the map from & to Xo 
defined by x. = K(x, + 1x2). In particular, with 6 any open ball in A 
centered at the origin, one has rx € 8 and 


O= > KH,,(rx + 10) = > r"KH,,(x + i0) 
m=0 m=0 

for x € 6 and |r| <1. It follows (see Ref. 11, proof of Theorem 6) that 

KH,, (x + 10) = 0 for each m and any x € X, showing that the H,, map 

X into Xo. This completes the proof. 


2.3.1 Comments 


Since the inverse image of an open set under a continuous map is 
open, we see that (i) is equivalent to the condition that there be an 
open subset S of @ and an open star V C @ with the following 
properties: Ag C S, A C V, for each v € V there is a unique w € S that 
satisfies f*(w) = v, and the map v — w is locally Lipschitz. This more 
sharply focuses attention on how machinery for proving existence, 
such as fixed-point techniques, might be used to establish expandabil- 
ity. A pertinent example can be found in Ref. 7, Appendix B. A onl 
related additional example follows. 

Let X be the space of real numbers, with the absolute value norm, 
and observe that the corresponding @ a the usual space of complex 
numbers. Take Xo = X, let f(x) = x + x° for all real x, and take A and 
Ao to be {x:]x| <r} and f7'(A), respectively, for some positive r. 
Notice that any r > 0 will do, and that our f* is given by f*(z) = 
z+ 2° for all complex z. 

An easy contraction-mapping argument* shows that given p € 
(0, ¥37!] and any complex number a with |a| < (p — p°), there is a 
unique complex number z with |z| < p such that z + z? =a, that z is 
real whenever a is, and that the map from a to z is locally Lipschitz. 
It follows from Theorem 2 that we have g € A(A, Xo) for r=ro, where 
ro = 2(3V3)71. 

Theorem 2 also can be used to show that g does not belong to 
P(A, Xo) if r > ro: Suppose, for the purpose of obtaining a contradic- 
tion, that g € A(A, Xo) for some r > ro. Then for some V and Vo as 
described in the theorem, f* restricted to Vo is a homeomorphism of 
Vy onto V. By the proof of Theorem 2, the inverse g* of f* is differ- 


* A good general source of information on the use of the contraction-mapping theorem 
is Ref. 16. 
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entiable. Using g*[f*(z)] = 2 for z © Vo, we have g*’[f*(z)|]f*’(z) = 1 
(z € Vo), where “’” denotes the ordinary derivative. Since f*’(z) = 0 
at z= 2, 4 i(V3)7, Vo cannot contain zp. Using this fact and the 
continuity of g*, it is not difficult to show that V cannot contain the 
point f*(z9) = 2i(3V3)~1. Since V is a star and A C V, 2(3V3)" € A, 
which is the contradiction sought. This finishes the discussion of the 
example. 

Using Theorem 1, it follows at once that Theorem 2 remains true if 
the word “star” is replaced by “c-convex star.” 

The hypothesis concerning f* at the beginning of Section 2.2 is 
equivalent to the condition that f € PAr(Xo, X); see the proof of 
Lemma 1 and Ref. 138 (top of p. 75). 


2.4 Convergence and the extent of invertibility 


In this section we prove a result that shows, in particular, that if g 
is expandable on A, and if its expansion converges on a larger open 
set B, then there is a set By that contains the points of Ao such that f 
is in fact an invertible map of By onto B. 

Theorem 3: Let g belong to A(A, Xo), and let it have the generalized 
power-series representation 


‘a= 5 n(x) (8) 


for x € A. Suppose that the right side of (8) converges in Xo for x € B, 
where B is an open subset of X such that A C B. Then there is an open 
subset Bo of Xo such that (i) Ao C Bo, and f is a homeomorphism of Bo 
onto B; and (ii) the inverse G of the restriction of f to Bo has the 
representation 


G(x) = Y gm(x), x EB. 
m=0 
Proof: Since g € A(A, Xo) and g is continuous, g © Ar(A, Xo) (see 
the proof of Lemma 1). Thus, by Ref. 13, Theorem 6.2, the function 
h:B — Xp» defined by 


h(x) = ¥Y gm(x), xEB 
m=0 
is analytic in the sense of Ref. 13 and hence continuous. Using Lemma 
1, there is an open connected set WC @and a Fréchet-differentiable 
map h*:W — & such that B C W and h(x) = h*(x + 10) forx € B. 
By Lemma 2 and the hypothesis that f[g(x)] = x (x € A), we find 
that 


f*[h*(v)] = v, ve W. (9) 
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Again using Lemma 2, and proceeding as in the proof of Theorem 1, 
one finds that 


h*[f*(v)] =v, v & Wo, (10) 


where W, is the component of f*~'(W) that contains h*(W). There- 
fore, f* is a homeomorphism of Wo onto W. 

We have, from (9), f[h(x)] = x (x € B). Now let By = f-'(B) N Ro, 
where Ry = {x © Xo:x + 10 = z, z © Wo}. Since Wo is open, Ro is open 
in Xo. Thus, Bo is an open subset of Xo, and from (10) one has h[f(x)] 
= x for x € By. This shows that f is a homeomorphism of Boy onto B, 
with h the inverse of the restriction of f to Bo. Finally, using Ao C 
f(A) Cf7(B), and Ap = h(A) C h(B) C h*(W) C Wo (which implies 
that Ap C Ro), as well as the definition of Bo, it is clear that Ap C Bo. 
This proves the theorem. 


2.5 Results for implicit functions 


Theorems along the lines of Theorems 2 and 3 are given here for 
maps that are defined implicitly in the sense of the implicit function 
theorem. In this section, X, stands for a third real Banach space, 4, 
denotes its complex extension in the sense of Secton 2.1, and Xo X X 
and @o X @ are product Banach spaces constructed from Xo and 
X and & and &, respectively.* We say that a map h defined on an 
open subset D of @ into Bp is Gateaux differentiable on D (see Ref. 
12, pp. 109-10) if for each v € D and arbitrary w € # the limit 
lim,.9 2 "[h(v + zw) — h(v)] exists, in which z is a complex scalar. 

As in Section 2.2, A is an open subset of X, with 0 € A. Here Fis a 
map from Xo X X into X, such that there is a continuous map G:A > 
Xo with the property that 


F[G(x), x] =0, x€A. 
Assume that there is an F* € Ar(Bo X B, F,) such that F(y, x) = 
F*(y + 10, x + 10) for (y, x) © Xo X X. 
Theorem 4: G € F(A, Xo) if and only if there is an open star VC &, 


with A C V, and a continuous Gateaux-differentiable map G*: V > Bo 
such that we have G(x) = G*(x + 10) (x € A) as well as 


F*[G*(v), vl] = 0, ve V. (11) 


Proof: First suppose that G € A(A, Xo). By Lemma 1 there is an open 
star VC & and a map G* € F;(V, Bo) such that A C V and G(x) = 


* Except where indicated to the contrary, the choice of the norms in Xp X X and & 
xX @ is not important for our purposes. It would suffice to let the norm || - || in Xo X X 
be given by || (xo, x) || = max(|| xo, || x ||), and similarly for @) xX @. 
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G*(x + 10) for x in A. Since G* € Azp(V, Bo), G* is F-differentiable 
on V and thus continuous and G-differentiable in V. 

It follows from a version of the chain rule (Ref. 15, pp. 171-2) that 
h defined by 


h(v) = F*[G*(v), v], vE V 


is F-differentiable on V. [Notice that h(v) = (F*Q)(v), where Q takes 
v € V into the point (G*(v), v) in @y X B.] Since V is connected, and 
F*[G*(6 + x + 10), x + 10] = F[G(x), x] = 0 (@ is the zero element of 
@) for x in some open ball in X centered at the origin, by Lemma 2, 
one has (11). 

Assuming, on the other hand, that we have a V and a G* as indicated 
in the theorem, G* is F-differentiable (see Ref. 12, Theorem 3.17.1) 
and an obvious modification of the part of the proof of Theorem 2 
that concerns H shows that G € A(A, Xo). This proves the theorem. 


Theorem 5: Assume that GE P(A, Xo), and that G has the generalized 
power-series representation 


G(x) = ». Gn(x) (12) 


for x € A. Suppose that the right side of (12) converges for x € B, where 
B is an open subset of X such that A C B. Then F[G(x), x] = 0 
(x € B), where G is defined for all x € B by (12). 

Proof: Paralleling the beginning of the proof of Theorem 3, there is 
an open connected set W C @ and a Fréchet-differentiable h*: W > 
> such that B C W and G(x) = h*(x + 10)(x € B). Using Lemma 2 
and the observation in the proof of Theorem 5 concerning the appli- 
cability of a version of the chain rule, we have F*[h*(v), v] = 0 for 
v € W, which implies that F[G(x), x] =0 (x © B). 

Remarks: Theorem 4 bears directly on problems concerning the exist- 
ence of generalized power-series expansions for the solutions of non- 
linear differential equations, because, as is well known, these equations 
can frequently be put in the form F[G(x), x] = 0, x € A, where x takes 
into account inputs and/or initial conditions, and G(x) is the corre- 
sponding solution. For related earlier work, see Ref. 8 and the refer- 
ences cited therein; the work includes, in particular, a description of 
the specific type of expansions that arise. 

A result similar to Theorem 2 in Section 2.3 can be obtained for 
equations of the form H(¥, x) = w, where y is a solution that depends 
on both x and w.* Specifically, suppose that H is a map from Xo X X 
into X, such that H(y, x) = H*(y + 10, x + 10) for all x and y for some 


* See Ref. 8, p. 75, for an example of how such an equation arises. 
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H* € F;(Bo X B, #,). Assume now that the norm || - |] in Xo X X is 
defined by || (xo, x) || = max(|| xo]], || x|]), and similarly for (X, x X), 
(Bo X #), and (@, X &). Assume also that Bp and &; are homeo- 
morphic in the sense of (ii) of Theorem 2. 

Let A,, and S, respectively, be open subsets of (X; x X) and 
(Xo X X), with (0,0) € A,,., such that for each (w, x) € A,, there is a 
unique y € Xo for which (y, x) € S and H(y, x) = w. Assume that the 
map from (w, x) to y is continuous. Define f:(Xo x X) — (X; X X) by 
f(y, x) = [H(y, x), x] for all y and x, and let A,, denote the open set S 
N f7'(Aux). Notice that f restricted to Ay, is a homeomorphism of A,,x 
onto A,,. Thus, using Theorem 2 and the observation in the footnote 
in Appendix B, we see that the map from (w, x) to y described above 
belongs to A(Ayx, Xo) if and only if (@, X #) and (fp X F), respec- 
tively, contain open subsets V and Vp with Ay, C V, Ayx C Vo, Va 
star, and [H*(y*, x*), x*] © V for (y*, x*) © Vo, such that for each 
(w*, x*) € V, there is a unique y* € & that satisfies (y*, x*) © Vo 
and H*(y*,x*) = w*, with the map from (w*,x*) to y* locally 
Lipschitz. 


2.6 Construction of the series for g of Section 2.2 


Here we return to the setting introduced at the beginning of Section 
2.2. Theorem 6 (in this section) provides an algorithm for determining 
the expansion of g whenever it exists. The theorem is a version of a 
result in Ref. 1 concerning complex spaces. We shall first prove a 
proposition that establishes the existence of certain derivatives that 
play a central role in the theorem. 

Proposition: For each m = 1, 2, --- the mth order Fréchet derivative 
(Ref. 14, pp. 179-81) df [g(0)] [of f at the point g(0) € Xo] exists, and 
one has 


flg(0) + x1 =fleOl + ¥ (my ta"ylg(Oe", xe Xo. 03) 


Proof: Using the hypothesis that f* € Pr( Bo, #), the Fréchet deriv- 
ative df*(w) and hence the derivatives d”f*(w) (m = 2, 3, ---) exist 
for w € B, and we have 


F*[g*() + w] = f*[g*(0)] + x (m!)*d™f*[g*(0)]Jw™, w EB (14) 


(see Ref. 12, Theorems 26.6.3 and 3.16.2; Ref. 15, Lemma 3.6.1; Ref. 
7, Lemma 2), where g*(0) = g(0) + 10. Since df* exists on @p, and the 
norm in & has the property that || x; + ix2 |] < 6 implies that || x, || <6 
and || x2 || < 6 [see (7)], it is easy to see that df exists on Xo and that 
one has df(a)x = df*(a + 10)(x + 10) for a and x in Xo. A simple 
inductive argument shows that df exists on Xo, with 
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d™f(a)x; +++ Xm = A" f*(a + 10)(x, + 10) --- (Xn +10) = (15) 


for a © Xo, (x1, +--+, Xm) € XG, and each m. This proves the 
proposition, since it is clear that f*[g*(0)] = f[g(0)]. The relation (13) 
directs attention to an interpretation of the d”f[g(0)]; it is not used 
otherwise. 


Theorem 6: Let g € A(A, X). Then df[g(0)] 1s a homeomorphism of Xo 
onto X, and 


g(x) = g(0) + yi En(x), x GA, 


where the gm are the homogeneous polynomials defined by 
gi(x) = df[g(0))"*x 


and 
n(x) = —df{g(0)I P) (“y3 


Y = dF [g(0)]gn,(x) +++ g(x), mz 2. 


kyt..-tk-=m 


2.6.1 Proof of Theorem 6 


The inverse of df[g(0)] exists because f is a homeomorphism of Ao 
onto A with f and g, respectively, Fréchet differentiable on Ap and A 
(Ref. 15, p. 175, Problem 6). Let 


g(x) = g(0) + y &m(x), x EA, 


in which each g,, is a homogeneous polynomial on X of degree m. 
With g* and V associates of g and A, respectively, in accordance 

with Theorem 1, we have g* € A;( V, Bo) and v = f*[g*(v)} for v € 

V. Let gi, g2, --- be continuous homogeneous polynomials such that 


g*(v) = g*(0) + y gh (v) (16) 


for v € V. By the part of the proof of Theorem 2 concerning H, gm (x) 
= g,(x + i0) for each m and each x € X. Using (14) and f*[g*(0)] = 
0, 


v = f*[e*(v)] 
= > (1) F*[g*(0)] (5, gt (v) --- » hw), vEV. 


Since g* € Pr(V, Bo), there is a co > 0 such that the right side of (16) 


REPRESENTATION THEORY 1653 


is absolutely convergent for || v || < o (Ref. 12, Theorem 26.6.6). Thus, 
using the boundedness of the d“f*[g*(0)] and an easily proved gener- 
alization of Theorem 5.5.3 of Ref. 14, 
v= » (4) 
7=1 Rk 


co 


y d’f*[g*(0))(gi,(v) «++ gz,(v)) (17) 
pee R=l 
for ||v|] < o, in which the sum over (ki, -:-, k,) is absolutely 
convergent. 

At this point we use the proposition that there are positive constants 
M, 8, K, and a such that 


|d“F*[g*(0)]wr «++ wel] < 77 MB* | wil] --- well 


and 
lgz(v) || < K(allv|})* 


for/21,k21,0vE€G, and wy, --- , win & (see Ref. 7, Appendix 
A). It is clear that 


| d’f*[g*(0)][gz, (v) eee gi_(v)] | < £°M(BK)*“(all v [)) fate RA) 
Consider the sum 


SY (ALM (BK) (all vf) et, (18) 


221 hy+ heal 


Notice that 


foe] 


XY (alo? = [alo] — allo) (19) 


Rye + yk = 


for a||u|| <1. Using (19) and Stirling’s formula for n!, which gives n! 
> (27)?n'?n"e, it easily follows that the sum (18) converges for 
|| v || sufficiently small. Thus (see Ref. 14, Theorem 5.3.4) for such v 
the sum in (17) over (7, k,, --- , k-), which equals 


z i EA ta FlgtOKEi() --- 2,0), 20) 
oe 


can be written as (see Ref. 14, Theorem 5.3.6) 


EPL Ayal) --- a0). @) 
a =m 


250 


By the uniqueness result for generalized power series mentioned in 
Section 2.1, and the fact that (21) equals vu for vu of sufficiently small 
norm, 


df*[g*(0)]gi(v) = v 
and 
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df*[g*(0)]gm(v) 


=-Y YA e*OBRW) --- gf] (n> 2) 
ms 


for vy € &. This, with v = x + 10 and (15), completes the proof. 


2.6.2 Comments 

For the case in which the expansion (14) for f* has only a finite 
number of terms, the proof of Theorem 6 simplifies considerably, 
because then the equivalence of (20) and (21) is a consequence of just 
the absolute convergence of the sum over (k;, ---, k-) in (17). A 
related result for this case is given in Ref. 17, p. 29. 

For a different approach to the problem of determining the expan- 
sion of g, see Ref. 18. 

The proof of Theorem 6 provides an alternative proof of Theorem 
2 of Ref. 1, which is an analogous result concerning only complex 
spaces. It also yields a proof of the following “substitution theorem” 
(an earlier version proved in a different way appears in Ref. 19).* 


Theorem 7: Take W,, W2, and W3 to be three complex Banach spaces, 
and let S, and Sz be nonempty open subsets of W, and W2, respectively, 
with S; a star. Let GE Ap(S,, We) and let F be a Fréchet-differentiable 
map of Sz into W3. Assume that G(S,) C Se. Then (FG)(-) € 
P;(S,, W3), the Fréchet derivatives d”G(0) and d” F[G(0)] exist for m 
= 1, and we have 


(FG)(v) = F[G(O)] + © An(v), v € Si, (22) 
m=1 
where the H,, are the homogeneous polynomials given by 


H,,(v) = y (7!) » d’ F[G(0)](Ri!) *d"™G(0)v* 
f=1 kyt+- : -+k=m 
kj>0 


++ (R/)1d**G(0)v™. 


Theorem 7 and Lemma 1 can be used to obtain results along the 
lines of Theorem 7 for cases in which the spaces of interest are real. 
This is discussed briefly in Appendix A. 
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APPENDIX A 


Substitution Results for Real Spaces 


This appendix presents two useful corollaries of Theorem 7. Proofs 
are omitted because the corollaries can be proved using direct modifi- 
cations of material already discussed. 


Corollary 1: Assume that W,, Wz, and W3 are real Banach spaces, that 
S, and S2 are open subsets of W, and Wz, respectively, and that 0 € S,. 
Let G be a map of S, into W2 such that the Fréchet derivative d”G(0) 
exists for m = 1, 2, --- , and G has the representation 


Gx = G(0) + & (m!)1d™G(0)x™, x € Si. 
m=1 


Suppose that G(S;) C S2. Let F map Sz into W; such that d”F[G(0)] 
exists for each m > 0, and 
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M 
Fly + G(0)] = F[G(0)] + x (m!)~'d" F[G(0)]y™ 


for y + G(O) € So, where M is a positive integer (and thus F is assumed 
to be a polynomial). Then (FG)(-) € A(S;, W3) and (22) holds. 

Corollary 2: Suppose that W,, W2, and W3 are three real Banach spaces. 
Let GE Pr(S,, W2) for some open subset S, of W, containing the point 
0, and let F € Pr(S2, W3), where Sz is an open subset of W2 containing 
G(0). Then the Fréchet derivatives d“G(0) and d”F[G(0)] exist for m 
> 1, and there is an open subset T, of S;, containing 0, such that G(T;) 
C S2, (FG)(-) € Ar(T,, W3), and one has (22), with S; replaced with 


T). 


APPENDIX B 


A Comparison of Norms on Complex Spaces 


Consider the Banach space # described in Section 2.1, and let ¥ 
denote a Banach space consisting of the same set of points with a 
possibly different norm || - |lg. 


Proposition: Let the norm ||-|l¢ have the property that ||x || < 
l| x1 + 1x2 |le for (x, + x2) € Y, in which || x || is the X norm of x. Then 
[| x1 + 1x2 |] < |] x1 + ixe lle for (x1 + ix2) E &. 

Proof: Assume, for the purpose of obtaining a contradiction, that 
|| x1 + ix || > |] x1 + txe lle for some (x; + 1x2). Then there is a & with 
| £ || = 1 such that 


E(x)? + E(x2)? > lx, + ixe |e. 
For this &, choose real a and 6 so that not both are zero and 
a&(xX2) + BE(x1) = 0. 


Using (a? + 6?)[E(x1)? + E(x2)?] > (a? + 6*)||x. + ixe|]% and the 
observation that [E(ax — by)]* + [&(bx + ay)]? = (a? + b7)[E(x)? + 
£(y)*] for real a and 6, and x and y in X, one has 


| E(ax, = Bxe) | > | (ax, = Bx2) a (Bx; + ax2) lle. 


Since the left side is at most || ax; — 6x2 ||, we have a contradiction. 


Comments: For X the space of bounded n-vector-valued functions 
described in Section 2.1, and # the corresponding complex Banach 


space with || v lv = max;sup,|v;(t)|, the equality || - || = || - lle holds, 
where || - || is the norm in &. Indeed, || - || < || - lz by the proposition, 
while with arbitrary t => 0 andj € {1, --- , n}, 


(x1)? + E(x2)? = [x1;(t)]? + [x2;(t)]? 
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for &— the linear functional of unit norm on X defined by &(x) = x;(¢), 
showing that |] - || = || - lle.* 

However, we have || - || < || - le but not || - || = || - le whenever X is 
a Hilbert space, || x1 + ixe lle = (|x: ||? + |] x2[|?)'”, and X is typical in 
the sense that it has a pair of nonzero elements x and y that are 
orthogonal. This follows from the inner-product representation of 
linear functionals in a Hilbert space, and the fact that for X, x, and y, 
as indicated above, one can show that 


(v, x)? + (v, y)? 
sup —_ 2 2 
wol=r Lx + yl 


where (., -) is the inner product in X. 
Finally, we mention that the norm in & cannot be replaced with 


I| (1, x2) I = (al? + Wel)”, (23) 


- because (23) does not define a norm in & unless X is a Hilbert space 
(see Ref. 20). It is not difficult to see that (23) does not suffice: If it 
did, we would have || ax — by ||? + || bx + ay ||? = (a? + 67)(|] x |? + |] v |?) 
for any real numbers a and b, and arbitrary elements x and y of X. 
This would give || x — y|/? + ||x + yl]? = 2(|[xll? + lly I), which is not 
valid unless X is a Hilbert space (see Ref. 21, p. 211). 
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The throughput per user in loop and bus configured local area networks 
decreases linearly with the number of users. These networks cannot be 
extended to a metropolitan area with many users. A class of mesh networks is 
described that increases the throughput of conventional local area networks 
by decreasing the fraction of the network capacity needed to transmit infor- 
mation between a source and a destination. These networks have multiple 
paths that increase the reliability of the networks, and have point-to-point 
links that can cover a metropolitan area. In general, mesh networks require 
complex store-and-forward nodes that also route messages, control the flow of 
data entering the network, resequence packets at the destination, and recover 
packets with errors. However, there are characteristics of the local or metro- 
politan area that allow these functions to be simplified. As a result of these 
simplifications, loop access protocols are extended to mesh networks and the 
need to store and forward data is eliminated. A file transfer protocol that does 
not require packet resequencing is described. Three mesh networks are studied, 
and the desirable characteristics of networks are determined. One network, 
the Manhattan street network, has many of the desirable characteristics. 


I. INTRODUCTION 


Loop topologies! and random access strategies” were first applied to 
local data networks in the late 1960’s. In that era, 

e Low-bit-rate terminals were connected to large central computers, 

e Computers and terminals were shared by a few computer experts, 
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e Large-scale integrated circuits did not exist, and 

e High-bit-rate transmission facilities were not readily available 

across public right of ways. 

As a result, these networks trade reliability, total throughput, and 
the distance the network can span? for simple access and transmission 
strategies. Today, for comparison, 

e Simple terminals are evolving into personal computers with bit 

mapped, rather than character, displays, 

e Computer usage is becoming universal, 

e Very-Large-Scale Integration (VLSI) is becoming commonplace, 

and 

e The increased deployment of optical fibers and CATV systems 

makes it possible to obtain high-bit-rate communications over 
wider areas. 

Personal computers use larger bandwidths than simple terminals to 
communicate with centralized support facilities and distribute proc- 
essing. The increasing use of these devices and the increased distances 
that high-bit-rate networks can span increase the throughput required 
of the interconnecting network. In loop and bus systems the total 
throughput is constant. The average capacity available to each user 
decreases linearly as the number of users increases. Therefore, to 
support more users with greater individual requirements, alternative 
topologies must be considered. The complexity of the devices being 
connected to networks and advances in VLSI make more complex 
network interfaces feasible. This increases the class of networks and 
access strategies that can be considered. 

A large number of users, dispersed over a large area, can be accom- 
modated by interconnecting conventional local area networks with 
gateways. Schlatter and Massey have analyzed this type of network.* 
Their system consists of loops interconnected by switching elements, 
as proposed by Pierce.* Messages use a smaller fraction of the total 
network capacity than they would if the system were a single loop. 
Therefore, the maximum throughput of the system increases. Users 
who communicate the most often are placed on the same loop, which 
minimizes the interference between subgroups of users. The main 
disadvantage with this approach is that the gateways are different 
from the access units and are complex store-and-forward elements. 

Yemini® and Saadawi and Schwartz’ are investigating a tree topol- 
ogy. In this network, users are at the leaves of the tree and the nodes 
of the tree are switching points. Depending on the location of the 
destination, the switches direct messages toward the root of the tree 
or toward the leaves. In Yemini’s system, the switches establish 
separate broadcast networks, and in Saadawi’s, the switches store 
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packets until the desired path is available. In these systems, messages 
only use a portion of the network capacity. Locating users who 
communicate frequently—who are near one another in the tree hier- 
archy—minimizes the interference between subgroups of users. The 
advantage of this approach over gateways is that there is only one 
type of element in the network. The disadvantage is that the network 
either stores and forwards packets or retains the distance constraints 
of broadcast networks. 

To a certain extent these alternatives remove the throughput con- 
straints of loop and bus systems. However, they still have single points 
of failure. To make networks more reliable, there must be multiple 
paths between each source and destination. By adding paths appro- 
priately, the average and maximum distance between nodes decreases, 
messages use a smaller fraction of the network bandwidth, and the 
throughput increases. Multiple paths also make it possible to avoid 
heavily used segments of the network to equalize the load. Mesh 
networks, like loop networks, have point-to-point communication 
channels between nodes. This results in less expensive line drivers 
and receivers than multidrop broadcast systems and is compatible 
with current optical fiber transmission capabilities. 

In general, mesh-configured networks and some of the local network 
alternatives require complex store-and-forward nodes. A queue of 
messages is maintained because packets arriving on several of the 
incoming links may be destined for the same outgoing link. In addition, 
store-and-forward networks must do routing, flow control, packet 
resequencing, and error control. Long-distance networks, such as the 
ARPA network,® perform these functions, but their interfaces are 
more complex than personal computers. Therefore, these networks 
are not a reasonable interconnection alternative for personal com- 
puters. There are, however, characteristics of the local or metropolitan 
area environment that make simpler mesh networks possible. 

Local or metropolitan area networks differ from general long-dis- 
tance networks in that the 

e Physical location of the nodes does not dictate the topology of the 

network to as great an extent, 

e Error rates are much lower, and 

e Communications lines are less expensive. 

In the local environment, it is not always necessary to eontieet the 
closest nodes together. Occasionally, connecting nodes that are further 
apart can make the topology of the network regular, and simplify tasks 
such as routing. The lower error rates make it more likely that a 
packet will traverse the entire network without error. Therefore, error 
control protocols can operate on an end-to-end, rather than on a link- 
by-link basis, and networks can have unidirectional links. When 
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messages are not stored at the intermediate nodes for possible retrans- 
mission, the class of possible access and transmission strategies in- 
creases. In general, there is also a trade-off between the system 
complexity and communications efficiency. The cost of communica- 
tions lines, and the additional access strategies and network options 
that are possible, result in a different network solution in local net- 
works than in long-distance networks. The result is that local networks 
are significantly less complex. 

There is a description in Section III of several regular networks 
with simple routing strategies. End-to-end error control protocols 
make it possible to extend the slotted system and register-insertion 
techniques developed for loop systems to mesh networks. This is shown 
in Section IV. With these techniques, flow control on mesh networks 
can be done by throttling the sources, as on loop networks. A trade- 
off exists between the buffering in these systems and the efficiency 
with which the communication lines are used. One attempt to take 
advantage of this trade-off is the Floodnet system.® There is a descrip- 
tion in Section IV of how this trade-off is applied to mesh networks 
with slotted system and register-insertion interfaces. The result is that 
for certain topologies mesh networks without buffering are reasonable. 
Finally, there is a description in Section V of several file-transfer 
protocols that do not resequence packets. 


Il. EXAMPLE 


Before discussing the implementation of mesh networks, we show 
that these networks provide a potential to increase the throughput of 
conventional local area networks. In this example, two-connected 
networks, with as few as 64 nodes, increase the throughput of bus 
configured networks by a factor of 20 to 30. This comparison assumes 
that the same rate communication lines are used in both the mesh 
and random access networks. A factor of two increase in throughput 
is obtained because there is twice as much capacity emanating from 
each node. However, the major portion of the increase occurs because 
messages in the mesh network use only a fraction of the total network 
capacity. Greater increases are obtained in larger networks. 

Two traffic distributions are considered, a uniform distribution and 
a skewed distribution. In the uniform case, each node sends an equal 
amount of data to each of the other nodes. The skewed distribution 
corresponds to what might occur in a network of personal computers 
and file servers. The network is divided into communities of interest, 
each consisting of a file server and seven personal computers. A 
personal computer directs 80 percent of its traffic to its own file server 
and 20 percent to the other file servers. The computer receives an 
equal amount of traffic from the file servers. 
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For each traffic distribution, the throughput for six network topol- 
ogies is investigated. The first two networks are the conventional 
broadcast bus and loop configured networks. The throughput of the 
bus network is calculated assuming that the link utilization can 
approach one, and is an upper bound on the achievable throughput. 
In the loop network, the packets only use the links between the source 
and destination. In this network, and in the remaining networks, the 
throughput is determined by increasing the traffic levels from the 
sources until the utilization on any link equals one. The remaining 
four networks are two-connected networks with two links arriving at, 
and two links emanating from, each node. The first of these networks 
is a conventional bidirectional loop. For the skewed distribution, the 
file server is in the middle of the seven personal computers it is 
servicing. The next two networks are regular arrays called the modified 
shuffle exchange and the Manhattan street network. These networks 
are described in Section III. The Manhattan street networks with 16, 
32, 48, and 64 nodes are 4 X 4, 8 X 4, 8 X 6, and 8 X 8 arrays, 
respectively. For the skewed distribution, the seven personal com- 
puters and the file server in a community of interest are arranged in 
a 4 X 2 array on the network. This is shown for the 16-node network 





Fig. 1—-A 16-node Manhattan street network with two communities of seven personal 
computers (T) and a file server (FS). 
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Fig. 2—A 32-node hierarchical network with four communities of seven personal 
computers (T) and a file server (FS) interconnected by shuffle-exchange networks. The 
four shuffle-exchange networks are connected by a bidirectional loop. 


in Fig. 1. The final network is a hierarchical shuffle exchange, con- 
sisting of shuffle-exchange networks with the eight devices in a com- 
munity of interest, interconnected by a bidirectional loop. Figure 2 
shows a 32-node, hierarchical network consisting of four 8-node shuf- 
fle-exchange networks. 

Traffic on the two-connected networks is placed on the shortest 
path between the source and destination. If there are several paths of 
equal length between a source and a destination, the path with the 
smallest flow is selected. Traffic with the shortest distance between a 
source and destination is assigned to the network first. Once a source 
destination requirement is assigned a path, the path is not changed if 
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Table |—The average megabits per user in networks with 10-Mb/s 
channels and the improvement over a broadcast network 


16 Nodes 32 Nodes 48 Nodes 64 Nodes 
Network Mb/Usr Imprv. Mb/Usr Imprv. Mb/Usr Imprv. Mb/Usr Imprv. 


Uniform Requirements 


BDCST 0.63 _— 0.31 —_— 0.21 — 0.16 — 

Loop 1.25 2.00 0.63 2.00 0.42 2.00 0.31 2.00 
BDL 4.69 7.50 2.42 yess) 1.63 7.83 1.23 7.87 
S-X 5.77 9.23 4.03 12.88 3.09 19.76 
MSN 6.52 10.43 4.56 14.59 4,31 20.70 3.94 25.20 
HS-X 4.69 7.50 1.96 6.28 1.41 6.75 1.13 7.20 

Skewed Requirements 

BDCST 0.36 _— 0.18 _— 0.12 — 0.09 — 

Loop 0.71 2.00 0.36 2.00 0.24 2.00 0.18 2.00 
BDL 2.63 7.37 2.08 11.67 1.80 15.11 1.59 17.82 
S-X 1.61 4.52 1.01 5.68 0.81 9.10 
MSN 2.63 7.37 2.17 12.17 2.12 17.80 2.12 23.76 
HS-X 2.78 7.78 2.78 15.56 2.66 22.34 2.63 29.47 


BDCST = Broadcast, BDL = Bidirectional, MSN = Manhattan street exchange, 
HS-X = Hierarchical street exchange, S-X = street exchange. 


a link on the path becomes saturated, and the requirements are not 
split if two equally good paths exist. This procedure does not lead to 
the optimum throughput, but gives a reasonably good idea of what can 
be achieved. 

The results of this investigation are presented in Table I. For each 
network, the average bit rate a user obtains in a network with 10- 
megabit-per-second transmission links, and the improvement this 
represents over a broadcast network, is presented. For the conven- 
tional broadcast and loop network, the fraction of the capacity a user 
obtains decreases linearly with the number of users, as expected. The 
loop system provides about twice as much throughput per user as the 
broadcast network because, on the average, a packet transmitted on 
this network uses only half of the network capacity. The two-connected 
networks obtain a factor of two increase in throughput because there 
is twice as much capacity emanating from each node, and an additional 
increase because the networks use a smaller fraction of the network 
capacity to transfer a packet between the sources and destinations. 

The bidirectional loop, the Manhattan street network, and the 
hierarchical shuffle exchange respond well to the skewed requirements. 
These networks are capable of allowing complete connectivity while 
preventing users in different communities of interest from interfering 
with one another. This characteristic is extremely important in de- 
signing large networks. The shuffle-exchange and Manhattan street 
networks also respond well to a large group of users with uniform 
transmission requirements. This occurs because the average distance 
between users does not increase as rapidly in these networks as in the 
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other networks. The average distance between users in the Manhattan 
street network is greater than that in the shuffle-exchange network; 
however, the throughput of the Manhattan street network is greater. 
The Manhattan street network can support a larger throughput be- 
cause there are more equal-length shortest paths, and bottlenecks can 
be avoided. . 


Ill. TOPOLOGY 


In this section, three two-connected networks are described, the 
bidirectional loop, the modified shuffle exchange, and the Manhattan 
street network. These networks have two independent paths between 
any node, and they can survive a single loop or node failure. While 
these networks are not optimal, they show what measures can be used 
to compare topologies, and what network characteristics are desirable. 
Lower bounds on two measures, the average and maximum shortest 
path between nodes, are derived and compared with these three 
topologies. 


3.1 Bidirectional loops 


In a bidirectional loop with N nodes, labeled 0 to N — 1, node i is 
connected to nodes (i — 1) mod N, and (i + 1) mod N. This is the only 
two-connected network with bidirectional paths between all of the 
nodes. If the transmission protocols require a response each time a 
packet of data is transferred between two intermediate nodes on a 
path, this is the only possible two-connected network. This network 
was initially considered as a mechanism to make loop networks more 
reliable. 

This network has many of the topological advantages of loop sys- 
tems. It 

e Is defined for any number of nodes, 

e Makes geographical sense, 

e Has a simple rule for expanding the network by one node at a 

time, and 

e Has a simple routing rule. 

When a node is added to the network, the two existing nodes closest 
to this node are disconnected from one another and connected to the 
new node. Even if the network covers a large geographical area, there 
are not many long wires. Shortest-path routing in this network is 
straightforward. The nodes in the network are sequentially numbered 
from 0 to N — 1. The distance from a source node s to a destination 
node d is (d — s) mod N on the incremental path and (s — d) mod N 
on the decremental path. At the source, the shorter of these two paths 
is selected. Once a packet in this system starts on a path, it remains 
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on that path. Therefore, a complete routing decision is made at the 
source, and the system is implemented as two separate loop systems. 

A disadvantage of this network is that the node addresses and the 
value of N changes whenever a node is added to the system. Either an 
addressing and routing scheme that does not use this information 
must be found, or this information must be distributed each time the 
network is changed. Another disadvantage of this network is that the 
throughput is not as great as that in the shuffle-exchange and Man- 
hattan street networks. 


3.2 Modified shuffle exchange 


The modified shuffle-exchange network is based on the shuffle- 
exchange multistage switch. The network is defined for N nodes, 
where N is constrained to be a power of two. Node i is connected to 
nodes 2*i mod N and (2*i + 1) mod N. This results in self-loops at 
nodes 0 and N — 1, which are not used to transmit packets. They also 
make the network less reliable in that a single link failure can discon- 
nect a node from the network. In the modified network, the self-loops 
are removed and nodes 0 and N — 1 are connected to one another, as 
shown in Fig. 3. When the shuffle-exchange network is part of a 
hierarchical structure, as in the previous section, the self-loops are 
replaced by connections to the higher-level network. 

Routing in this network is straightforward. Initially, ignore the two 
paths that were added to the modified network. Represent the address 
of node i by M = log, N bits, and label the paths to nodes 2*i mod N 
and node (2*i + 1) mod N as 0 and 1, respectively. When a packet is 
transmitted from node 1, the address of the new node has the low- 
order M — 1 bits of node 7’s address in the high-order M — 1 bits. The 
low-order bit of the new address is 0 or 1, depending on the path 
selected. To find the shortest path between a source and destination, 
match as many of the high-order bits of the destination address with 
the low-order bits of the source address as possible. To get to the 
destination, shift the low-order bits of the destination address that are 
not included in this match into the address and determine the path 
that must be selected. 

For instance, assume that the source address is 11011 and the 
destination address is 11001. The first two bits of the destination 
address match the last two bits of the source address. The bits 001 
must be shifted into the address to get to the destination, and the 
distance to the destination is 3. To get to the destination, first path 0 
is taken to node 10110, then path 0 is taken to node 01100, and finally 
path 1 is taken to node 11001. 

If none of the high-order bits of the destination match the low-order 
bits of the source, then the distance to the destination is log,N steps. 
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Fig. 3—An 8-node modified shuffle-exchange network. 


This is the maximum distance between any source and any destination. 
In Section 3.4, this will be shown to be the minimum maximum 
distance between nodes for a two-connected network of this size. 

The two paths that are added to the modified network make it 
possible to go from the all-one address to the all-zero address, and vice 
versa, in a single step. This shortens the average distance between 
nodes. These paths cannot change the maximum distance between 
nodes since this is already a minimum. The effect of these paths on 
the distance between nodes is shown in Table II. The network S-X is 
the shuffle exchange with two self loops and MS-X is the modified 
network, It is evident that the additional paths do not provide a great 
decrease in the average minimum distance between nodes. They should 
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Table II—The average and maximum 
shortest distance between nodes in 
the shuffle-exchange network and 

the modified shuffle-exchange 


network 
Average 
Nodes S-X MS-X Maximum 

4 1.50 1.33 2 

8 2.11 1.96 
16 2.83 2.73 4 
32 3.65 3.58 5 
64 4.53 4.49 6 
128 5.46 5.44 7 
256 6.42 6.40 8 


be included to allow an alternate path when failures occur, but unless 
a simple routing rule is found to use them under normal operation, 
they should not be used. 

There are several problems with this type of a network. The first 
problem is that the physical layout of this network does not make 
sense geographically. If half of the nodes in the network are in one 
area and half of the nodes are in another remote area, then half of the 
connections must be between the two remote areas. Therefore, the 
network can only be used in a small area where the length of the 
interconnections do not make a difference. Shuffle-exchange networks 
in physically disjoint areas must be interconnected by a hierarchical 
network that can make sense geographically, as in the example in 
Section II. The second problem is that the network is only defined if 
the number of nodes equals 2’. At present, no way has been found to 
add one node at a time—changing a small number of connections— 
and move from a network with 2' nodes to a network with 2'*! nodes. 
The third problem is that the alternate paths between a source and a 
destination are not good. If the preferred path is blocked or inoperable, 
the alternate paths are much longer. 


3.3 Manhattan street network 


The Manhattan street network is based on-a grid of alternatingly 
directed streets and avenues, as shown in Fig. 1. The nodes exist on 
the corner of a street and an avenue. The rationale for this type of a 
network is that routing from:a particular street and avenue to a 
destination should be straightforward. As in a city with this layout, 
any destination street and avenue can be found without asking direc- 
tions, even when some roads are blocked. In addition, it should be 
possible to lay out the network to make sense geographically. 
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The principal difference between a grid connecting corners with 
streets and a grid connecting nodes with wires is that the physical 
constraints associated with a two-dimensional surface can be violated 
more easily with wires. For instance, in the example in Section II, the 
file servers and terminals forming a community of interest are in the 
same neighborhood. Assume that the file servers in this system are in 
the same room and that the personal computers in the same commu- 
nity of interest are in the same physical area. By connecting the file 
server to the region of the network with the terminals, rather than 
basing the connections strictly on the physical location of devices, the 
file server appears to be in the same neighborhood as the terminals. 
This reduces the interference between terminals in different commu- 
nities of interest. 

The difference in physical constraints also allows the extremes of 
the grid to be connected. These connections form the grid on the 
surface of a torus instead of a flat surface. The advantage of this cyclic 
surface is that there are no corners. Therefore, the maximum distance 
from a source to a destination is not the distance between two corners 
of the grid, but the distance between the center and one of the corners. 
The graph can also be flipped so that the links leaving the center node 
are always pointed in the same direction. This allows the same routing 
decision function to be used at every node. 

Consider a network with r rows and c columns. The current node 
has coordinates (i,, j;), and the destination node has coordinates 
(ia, ja). The current node is considered to be at location (0, 0), and the 
relative location of the destination (i,j), is expressed as 


i= {a — 2(j,mod 2)](ig — i,) + , 7 i} mod r— (; = 1) 


j= {a — 2*(izmod 2)](ja — Js) + 5 = i} mod c — (; a 1). 


The current node is now in the center of the network. The value of i 
is between —(r/2 — 1) and r/2, and j is between —(c/2 — 1) and c/2. 
The factors 1 — 2*(j,mod 2) and 1 — 2*(i,mod 2) guarantee that the 
links leaving the current node point toward increasing i and j. The 
routing decision now depends only on the relative location of the 
destination and not on the current node. 

The routing preference from the central node to outlying nodes for 
a 12 xX 12, 12 x 14, and 14 X 14 Manhattan street network is shown 
in Fig. 4. In this network, the two links emanating from the central 
node are directed upwards and to the right. The routing preference is 
the shortest distance from the central node to the destination when 
the link to the right is taken, minus the shortest path to the destination 
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Fig. 4—Routing preference in a 12 X 12, 12 x 14, and 14 x 14 Manhattan street 
network. 


when the upwards-directed link is taken. Therefore, a negative number 
implies that the right link leads to the shortest path to the destination, 
and a positive number implies that the upwards link yields the shortest 
path to the destination. The magnitude of the number shows how 
much longer the distance to the destination would be if a packet were 
forced to take a less desirable path. A zero implies that the distance 
to the destination is the same along either path. The figures show that 
to get to half of the nodes either path can be taken, to get to a quarter 
of the nodes the left path should be taken, and to get to the other 
quarter of the nodes the right path should be taken. The figures also 
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Fig. 5—Adding nodes E, F, G, and H one at a time to the basic rectangular structure 
consisting of nodes A, B, C, and D in a Manhattan street network. 


show that if a packet is forced to take the wrong path, the increase in 
path length to the destination is never more than four. 

One problem with the shuffle-exchange network is the difficulty in 
changing the number of nodes in the network. Figures 5 and 6 show 
how nodes may be added to the Manhattan street network. Figure 5 
shows how two columns are added to the basic square structure within 
the Manhattan street network. The dotted lines show the links that 
will be broken when the next node is added. Figure 6 shows how the 
procedure is continued to add nodes to partially full columns. Each 
time a new node is added, two links are broken and connected to the 
new node. This is no greater than the number of links that must be 
broken in the bidirectional loop. Eventually this procedure leads to a 
network with two additional rows or columns, and the pattern of 
alternatingly directed rows and columns is preserved. 
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Fig. 6—Adding a node K to a partially full column in a Manhattan street network. 


When adding a new node both the physical position of the node and 
the topology of the network must be considered. It is desirable to 
connect the node to the nearest existing nodes, but it is also desirable 
to start as few new rows or columns as possible, and to keep the 
number of rows and columns equal. When rows and columns are kept 
approximately equal, the average and maximum shortest paths be- 
tween nodes increase as shown in Table III. 

In the shuffle-exchange network it is occasionally better to establish 
a hierarchy of networks rather than make a single network larger. 
Hierarchical structures are also useful in Manhattan street networks. 
They are used to 
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Table III—Distances between nodes in 2i 
by 2) Manhattan street networks 


Shortest Paths Be- 
tween Nodes 


Nodes Rows Columns Average Maximum 


4 2 2 1.33 2 

8 2 4 2.00 3 
16 4 4 2.93 5 
24 4 6 3.30 5 
36 6 6 3.71 6 
48 6 8 4.34 7 
64 8 8 5.02 9 
80 8 10 5.42 9 
100 10 10 5.84 10 
120 10 12 6.42 11 
144 12 12 7.02 13 
168 12 14 7.45 13 
196 14 14 7.89 14 
224 14 16 8.45 15 
256 16 16 9.02 17 


e Decrease the number of paths between physically distant sections 

of the network, 

e Eliminate long paths between communities of interest, and 

e Prevent traffic between communities of interest from affecting 

communications in other communities of interest. 

The two-connected strategy can be maintained, as in Fig. 7. How- 
ever, this will make routing more complex. An alternative is to connect 
one or more of the nodes in a local area to a higher-level network, as 
shown in Fig. 8. By using this approach, routing decisions in a local 
area are not affected by network changes in other areas, and addresses 
in different local areas are assigned independently. A hierarchical 
addressing and routing structure, similar to that used in the telephone 
system, can be used. For example, the address within the local area 
corresponds to a phone number, and the address of the local network 
on the higher-level network corresponds to the area code. When 
sending a packet within the local area an area code is not required. 


3.4 An optimal two-connected network 


Certain characteristics of the “best” networks are difficult to quan- 
tify. For instance, it should be possible to add nodes without making 
major reconfigurations, create geographically dispersed networks with- 
out adding excessive numbers of long links, and establish communities 
of interest. Other characteristics, such as the average and maximum 
number of links between nodes, can be compared and bounded. 

Consider the class of two-connected networks. From a particular 
node, at most two nodes can be reached in one step, four additional 
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Fig. 7—A hierarchical Manhattan street network in which all of the nodes are two- 
connected. 


nodes in two steps, and so on. The destination nodes form a binary 
tree. If, at any level in the tree, a destination node recurs, the number 
of new nodes that can be reached in future levels is reduced by the 
descendants of that node. Therefore, the maximum number of nodes 
that can be reached in m steps is 

yo Sor a 

1=1 
If every node in a two-connected network can reach this number of 
new nodes in each m steps, then the network has the smallest average 
and maximum distance between nodes. In general, networks with 
these characteristics do not exist. However, this is a lower bound on 
these distance characteristics. . 

In the shuffle-exchange network with 2’ nodes, each node must 

reach 2/ — 1 nodes, and the maximum minimum distance between 
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the hierarchical network are four 


nodes is j. The number of nodes that are reached in j — 1 steps in the 


2. Therefore, if 2’ — 1 nodes must be reached 


optimum network is 2/ — 


the minimum distance to the furthest node 


is j, and the largest minimum distance between nodes in the shuffle- 
exchange network is less than or equal to that in any network with 2’ 


on the optimum network, 
nodes. 
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Table IV—A comparison of the distances between nodes in several 


networks 
Number of Nodes 
Net Distance  160~—”—~—“Ct ARC t~—<Ctsé‘éi 
Opt Avg ie 2.53 ; 4.19 : 6.06 : 
S-X Avg 23 2.73 ; 4.49 F 6.40 _ 
MSN Avg ee 2.93 - 5.02 ‘ 9.02 - 
BDL Avg — 4.26 P 16.25 i 64.25 38 


for the optimum, shuffle-exchange, Manhattan street network, and 
the bidirectional loop is shown in Table IV. In both the optimum 
network and the shuffle-exchange network, the maximum distance 
between nodes varies as the log of the number of nodes. In the 
Manhattan street network, the maximum distance between nodes 
varies as the square root of the number of nodes, and, in the bidirec- 
tional loop, this distance varies linearly with the number of nodes. 
The same relationship is also noted between the number of nodes in 
these networks and the average distance between nodes. The average 
distance between nodes shows what fraction of the network resources 
is used to transfer a packet and provides an indication of the relative 
throughput of the networks. Although, as shown in Section II, there 
are other factors that also affect the throughput. 


IV. IMPLEMENTATION 


In a mesh network, as in a loop network, the communications lines 
are point-to-point links with a single transmitter and a single receiver. 
Transmission on these links is much simpler than in a broadcast 
network with a shared communication channel. The access protocols 
are simpler because there is only one source, and it is not necessary 
to multiplex users on the communication channel or resolve collisons. 
The receiver is simpler because the distance between the source and 
destination is constant, and the signal strength does not change by a 
large amount from packet to packet. Regenerating the signal to elim- 
inate distance constraints is simpler because signals only propagate in 
one direction. Timing recovery is simpler because the source can 
transmit continuously and bit synchronization does not have to be 
reestablished at the beginning of each packet. In addition, the com- 
munication line does not have many taps and is compatible with the 
current generation of fiber-optic equipment. 

In a two-connected network there are two links and a local source 
inputting data to a node, and two links and a local sink removing data 
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from the node. Occasionally, multiple inputs try to transmit data to 
the same outgoing link. One way to resolve this problem is to queue 
packets waiting for a link. The network now assumes the complexity 
of a store-and-forward network. Not only must potentially large packet 
queues be maintained, but adaptive-routing, flow-control, deadlock- 
avoidance, and packet-resequencing issues must be addressed. 

In this section, the slotted-system and register-insertion techniques, 
developed for loop communication systems, will be extended to mesh 
networks with equal in and out degrees. The general strategy guaran- 
tees that every packet arriving on an incoming link, and not destined 
for the node, will be transmitted on one of the outgoing links. There- 
fore, it is not necessary to maintain a packet queue for the links 
emanating from the node. The requirement that packets passing 
through the node take one of the outgoing paths results in longer 
paths when the shortest path to the destination is busy. However, it 
is possible to design networks to reduce the effects of incorrect paths. 
For instance, in the Manhattan street network the path to only half 
of the destinations is increased if a packet is forced to take one path 
rather than the other. In addition, if a packet is forced to take a less 
desirable path, the distance to the destination is increased by at most 
four. 

The storage between the local source and the network is also limited. 
Packets from the local source are only transmitted when one of the 
outgoing links is not being used by an incoming link. It is assumed 
that either the local source can be throttled when the network is busy, 
or that the source provides data at a low rate relative to the network 
transmission rate. In the latter case, when a packet is lost, it must be 
recovered by a higher-level protocol. If the network delivers packets 
faster than the local sink can accept them, packets are either trans- 
mitted on one of the outgoing links or discarded. In the former case, 
the network is used for storage. Since new packets cannot enter the 
network when it is recirculating old packets, this transmission strategy 
acts as a flow-control mechanism. The assumptions on the local source 
and sink are implicit in all loop-configured systems without infinite 
storage. 

The packets of data in a slotted system are fixed size. A node 
continuously transmits bits on each of the links emanating from the 
node, and periodically transmits a start-of-slot indication. The start- 
of-slot indication is followed by a packet of data or an empty slot. In 
the interval between the start-of-slot transmissions, at most one 
packet of data is received on each of the incoming links. The packets 
that are received between start-of-slot transmissions are forwarded 
after the start of slot is transmitted. These packets are switched to 
one of the outgoing links or the local sink before data from the local 
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Fig. 9—Extension of slotted-loop systems to a mesh network. 


source are given access to the slot. Since there are the same number 
of links arriving and leaving from each node, and the local source can 
be throttled, a queue of packets will not accumulate. The operation of 
a slotted system without a packet queue is shown in Fig. 9. 

The interface for a register-insertion loop is shown in Fig. 10. 
Packets in this system are variable in size, but constrained to be less 
than the storage register W;. The local source is only allowed to 
transmit when register W, is empty. Since a packet from the local 
source is less than W,, all data received from the loop while the local 
source is transmitting can be stored in W;. When register W, is not 
empty, bits from this register are transmitted on the loop. Therefore, 
the length of this register remains the same when bits are being 
received, and decreases when bits are not received. As long as this 
register is not empty, gaps between arriving packets are removed. 

The register-insertion technique can be applied to a mesh network 
in which the in degree and out degree of the nodes are the same by 
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Fig. 10O—A register-insertion access unit in a loop system. 
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Fig. 11—Extension of register-insertion systems to a mesh network. 


making the node appear as if several loops are passing through it. This 
is shown in Fig. 11. Registers W,, and We. correspond to register W, 
for loops 1 and 2, respectively. In addition to the local sink, register 
Wie appears to be a sink for loop 1. And, in addition to the local 
source, register Wj. appears to be a source of data for loop 2. Therefore, 
register W,, allows messages on loop 1 to transfer to loop 2. As in a 
loop system, if buffer Wi, is full the packet must continue around loop 
1. Buffer W.; serves the same purpose for packets crossing from loop 
2 to loop 1. 
The register-insertion technique allows variable-length packets. 
When the system is busy, each node eliminates the null space between 
incoming packets to efficiently use the outgoing links. The slotted- 
system technique uses fixed-size packets. If there is less than a packet 
of data, the packet is partially empty. Therefore, the register-insertion 
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technique uses the channel more efficiently. In a slotted system, the 
packets from the incoming links are aligned at the switching point. A 
packet only traverses a longer path if more than one incoming packet 
requires the same outgoing channel. In a register-insertion system, a 
packet can only transfer from one loop to the other if the crossover 
buffer is empty. It is possible that both packets on the through loops 
would rather be on the other loop, but cannot cross over because the 
buffers are full. Therefore, the register-insertion technique uses indi- 
vidual links more efficiently, but the slotted system takes a shorter 
path between the source and destination. The technique that provides 
the greater throughput depends on both the message-length distribu- 
tion and the network topology. 

A small amount of buffering can be included in either the slotted or 
register-insertion system to reduce the probability of a packet taking 
a longer path. In the slotted system, fixed-size packet buffers are 
inserted at the output channels. The probability that a packet must 
take a longer path is the probability that two arriving packets must 
take the same path and the buffer for that path is full. Without 
buffering, one packet must take the longer path whenever two arriving 
packets want to take the same path. In the register-insertion system, 
the additional storage is inserted at the crossover point. A packet 
cannot cross over if, when it is received, there are fewer bits available 
in the crossover buffer than there are in a maximum-size packet. In 
the original system, a packet cannot cross over if, when it arrives, the 
crossover buffer is not empty. Decreasing the probability that a packet 
takes a longer path decreases the fraction of the network resources 
that a packet uses, and increases the throughput of the system. The 
trade-off between buffering and system throughput remains to be 
investigated. 


V. FILE TRANSFER 


A file transfer consists of several packets being transmitted from a 
source to the same destination. In a system in which packets do not 
take the same path, it is possible that packets are not received in the 
same order that they are transmitted. Packets may be resequenced at 
the receiver, however, it is preferable to avoid this task. 

One possible solution to this problem is to transmit one packet at a 
time and wait for an acknowledgment. This reduces the file-transfer 
rate. However, because of the small delays at each node, this solution 
is not as bad in mesh networks as it is in store-and-forward networks. 
For instance, in a slotted system the average delay per node is half a 
slot time, and the average round trip delay equals the average number 
of hops between nodes, L, times the slot time. Therefore, there is an 
average of L slots between each packet in the file, and the file-transfer 
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rate equals the channel rate divided by ZL + 1. Higher file-transfer 
rates can be achieved by end-to-end protocols that take advantage of 
the delay characteristics of the system, or node protocols that take 
advantage of specific hardware structure. 

Because of the small amount of delay at each node, it is unlikely 
that packets that take routes that have approximately the same length 
will arrive out of sequence. This probability can be reduced by allowing 
a small number of slots between packets in the same file transfer. A 
simple file-transfer protocol, which takes advantage of this character- 
istic, operates like the window protocols of store-and-forward networks 
and the go-back-N protocols of satellite systems. This protocol labels 
packets in a file transfer with a sequence number and a retry number. 
At the beginning of a file transfer, the transmitter and receiver start 
with the same sequence and retry number. The sequence number is 
the order of the packets. The receiver 

e Increments its retry number when a packet with the expected 
retry number and a larger sequence number is received, 

e Sends a positive acknowledgment if a packet has a sequence less 
than or equal to the expected number, 

e Sends a negative acknowledgment with its retry number and 
expected sequence number if the packet has a larger sequence 
number than it expects, and 

e Commits a packet if it has the expected sequence number. 

The transmitter 

e Stops saving a packet for retransmission when it receives a posi- 
tive acknowledgment for a packet with a sequence number greater 
than or equal to the expected number, 

e Adopts a new retry number and starts retransmitting from a 
negatively acknowledged sequence number if a negative acknowl- 
edgment with a larger retry number is received, and 

e Periodically retransmits the last packet in a file transfer until it 
receives an acknowledgment. 

The transmitter initially transmits packets in the file transfer in 
every available slot. However, when it receives negative acknowledg- 
ments, it increases the number of slots between subsequent packets. 

Since this protocol only accepts packets in the correct order, packets 
do not have to be resequenced at the receiver. In a mesh network, 
several packets can be in transit between the source and destination. 
If the receiver misses a packet, it must send a negative acknowledg- 
ment for every packet with a larger sequence number than expected 
to be certain that the trasmitter receives the negative acknowledgment. 
The retry number is included so that the transmitter only backs up 
and starts retransmitting when it receives the first negative acknowl- 
edgment to an outstanding packet. 
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The transmitter adaptively changes the number of slots between 
packets according to network load and the rate of the receiver. When 
the network is lightly loaded, all packets follow the best path to the 
receiver, and arrive in sequence. If the receiver can accept packets at 
this rate, there are no negative acknowledgments, except for infrequent 
transmission errors, and the file-transfer rate equals the channel rate. 
When the network is heavily loaded, the packets follow different paths, 
are received out of sequence, and the file-transfer rate decreases. If 
the receiver cannot accept packets as fast as the transmitter can 
deliver them, the buffer in the interface unit is full when the packets 
arrive. In the systems described, these packets are directed to one of 
the output links at the node, and recirculate in the network until the 
buffer is available. These packets arrive out of sequence, negative 
acknowledgments are transmitted, and the transmitter slows down. 

Additional improvements are possible by taking the structure of the 
nodes into account. In a slotted system, subsequent packets in a file 
transfer can be marked. At a node, packets in a file transfer, which 
follow immediately behind one another, can be directed along the same 
path. The end-to-end protocol can be implemented with empty slots 
only occurring when the source cannot deliver packets quickly enough, 
or when the channel at the source node is busy with traffic passing 
through the node. This will improve the file-transfer rate on moder- 
ately used channels. If this modification is used, the transmitter and 
receiver must negotiate the number of packets in a continuous se- 
quence to be certain that the receiver can accept them at the channel 
rate. 

In a register-insertion system, if the loop paths define a Hamiltonian 
circuit, packets in a file transfer can be constrained to follow these 
loops. Since a packet can never be denied access to this loop, all 
packets follow the same path and will be received in sequence. This 
allows file transfers to occur at the channel rate without resequencing, 
but requires file transfers to use a larger fraction of the network 
resources. 

In both the register-insertion and slotted systems, it is possible to 
use a higher-level protocol to set up a limited number of virtual circuits 
along which file transfers can occur efficiently at the channel rate 
without resequencing. In the slotted system, the higher-level protocol 
is used to assign an input at a node to an output. When a file-transfer 
packet arrives at a node input, it is given first priority to the assigned 
output. Therefore, all packets in the file transfer follow the same path 
and do not have to be resequenced. The function of the higher-level 
protocol is to make the assigned paths for a file transfer use as few 
links as possible. The problem with this approach is that the preferred 
paths must be established at the beginning of each file transfer and 
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disabled at the end of the transfer. In addition, a file transfer may be 
temporarily blocked by previously assigned paths, creating a need for 
a scheduler. In the register-insertion system, the preferred paths are 
established as the paths through the node. This has the same problems 
as the slotted system. 


VI. CONCLUSION 


Mesh networks increase the throughput of conventional local area 
networks by decreasing the fraction of the network capacity needed to 
transmit information between a source and a destination. These 
networks have multiple paths between each source and destination, 
thus increasing the reliability of local networks. The networks consist 
of point-to-point links, and can be extended to cover a metropolitan, 
rather than a local, area. 

In general, mesh networks require complex store-and-forward nodes 
that also route messages, control the flow of data entering the network, 
resequence packets at the destination, and recover packets with errors. 
There are characteristics of the local or metropolitan area that allow 
these functions to be simplified. In the local environment, regular 
network topologies can be selected in which routing is straightforward. 
The lower error rates make it reasonable to recover errors on an end- 
to-end basis. This allows loop-access protocols to be extended to mesh 
networks, eliminating the need for buffering and additional flow- 
control protocols. Extensions for the slotted system and register- 
insertion techniques used in loop systems have been shown. Buffering 
can be included in these systems to improve the channel utilization; 
however, channels are less expensive in the local environment. The 
small node delays in these systems also make it reasonable to imple- 
ment file-transfer protocols that do not require packet resequencing. 

Three mesh networks have been studied, and the desirable charac- 
teristics of networks have been determined. Networks should have 
regular structures with simple routing rules, and should not have 
single points of failure. By minimizing the average and maximum 
distance between nodes, the fraction of the network resources used to 
transmit a packet decreases, and the throughput increases. This can 
be done by packing topologies with these characteristics noted, and by 
locating communities of terminals that communicate frequently in the 
same area of the network. Equal-length alternate paths between 
sources and destinations reduce the probability of bottlenecks and the 
need for buffering within a node. Networks will change, and it must 
be possible to add or delete nodes without changing a large number of 
connections. If the network covers a large area, it must be possible to 
limit the connections between nodes in different areas. Of the networks 
studied, the Manhattan street network has all of these characteristics. 
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It is assumed that a data carrier signal is transmitted over a fading channel 
whose frequency response can be closely approximated over the transmission 
band by a first-order polynomial in frequency, the coefficients being slowly 
varying functions of time. An equivalent baseband model is obtained wherein 
the transmitted signal is of the form s(t) = )}*.. a;f(t ~ kT), and the received 
signal is of the form r(t) = s(t) + x(t)s’(t), where x(t) is an unknown (e.g., 
random) function of time. The problem solved in this paper is that of finding 
the function f of prescribed bandwidth that minimizes the mean-square error, 
E{|r(nT) — a,|?}, under the assumption that the a, are independent random 
variables of zero mean and unit variance, and E{|x(t)|?} = a*. The results 
also apply to the sometimes more realistic hypothesis, | x(t) | < a. 


I. INTRODUCTION 


A common method of transmitting data {a,} and {b,} is via a carrier 
signal of the form 


8,(t) = s;(t) cos wt + se(t) sin wt, (1) 
where 
a(t) =Saflt-kT), st) = SD oflt- AN). 2) 


In the usual mathematical model, f(t) = (sin Qt)/Qt, Q = x/T, so that 
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si(nT) =an; 82(nT) = by. 


Assuming then that s,(¢) is transmitted over an ideal noiseless channel, 
s,(t) and s2(t) can be recovered by synchronous demodulation of the 
received signal, and then the data obtained by sampling s; and s¢ at 
the times nT’. 

A problem, communicated by G. Foschini, arises when s,(t) is 
transmitted over a channel having a relatively slow-varying fading 
characteristic. L. J. Greenstein and B. A. Czekaj have found that, in 
many cases, the fading channel response can be fairly well approxi- 
mated over the transmission band by a first-order polynomial in 
frequency w, 


Ao(t) + Ar(t){w — we} + Bi(t)ifw — we}, 


with slowly varying real coefficients.’ Then, to a good approximation, 
synchronous demodulation of the received signal gives, instead of s(t), 


ry(t) = Ao(t)si(t) + Bi(t)si(t) + Ar(t)s3(¢), 


and a similar expression for the alteration of s(t). 

We assume that A(t), the center-frequency channel gain, is positive 
and can be determined at the receiver (e.g., by measuring the average 
power in a narrow band about the center frequency), so that, by the 
use of automatic gain control we have available, 


ri(t) = si(t) + x:(t)si(t) + x2(t)s3(t), | (3) 
where x,(t) and x(t) are unknown, for example, random, slowly 
varying functions of time. Then 

ri(nT) = an + x1(nT)s{(nT) + x.(nT)s3(nT). (4) 


It has been suggested,” as an alternative to using adaptive channel 
compensation, that error-free reception may be obtained by doubling 
the bandwidth of f(t) in (2), i.e., by taking f(t) = (sin Qt)?/(Qt)?, Q = 
az/T, so that sj(nT) = s3(nT) = 0. Here we want to determine the best 
attainable trade-off between mean-square error and bandwidth under 
certain assumptions on {a,}, {b;}, x(t), and x2(t). 

We have, from (2), 


ri(nT) — an = an{f(0) — 1} + ¥ aef(nT — kT) 
k¥n 


+ x,(nT) y axf’(nT — kT) + x2(nT) y b.f’(nT — kT). (5) 


Now we assume that the a; and b; are independent random variables 
of zero mean and unit variance. Then the expected mean-square error 
over {a;} and {b,} is 
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Bt|H(nT) = an} = [1 = FOP + 1 FHTY? 
+ fxn) + x(n T)} Df (RT) [2 + Qaim) Y f’(RT) RT) 
0 k#0 


+ 2xi(nT)[F(0) — 1]f'(0). (6) 


[Note that the cross product x,(n7T')x2(nT) does not enter here because 
of the assumptions on {a,} and {b;}.] 

Now let us suppose that x:(t) and x(t) are random (continuous) 
functions satisfying 


E{|x(t) |} = ai,  Ef{x(t)} = 0, (7a) 
E{| xo(t) |7} = a3. (7b) 


Then we have, for the expected, or mean-square error, ¢’, 
é=|1 — FOF + 2 {f(RT)|? + a? Y F(RT) I, (8) 
#0 —0o 


where a? = aj + a3. 
As an alternative to the statistical assumptions, (7a) and (7b), let 
us suppose that 


| x1(t) | s 1, (9a) 
| xo(t)| S ae. (9b) 


These assumptions may be more relevant in practice than the statis- 
tical assumptions. Note that if the coefficient of x,(nT) in (6) vanishes, 
which will be the case if f(t) is even; then, under the assumptions (9a) 
and (9b), (8) will hold with the equality sign replaced by <. We wish 
to minimize the quantity on the right in (8) over bandlimited functions 
f of prescribed bandwidth. As we shall see, the minimum will be 
obtained for a real-valued even function, so that, indeed, the minimum 
will be an upper bound for ¢? under the assumptions (9a) and (9b). 

Note that, in the end, the quantity to be minimized depends only 
on how large | x:(t)| and | x2(t)| may be, slowly varying or not. The 
slowly varying hypothesis was used only to obtain (3) from the fading 
channel model. The same minimization problem is obtained, under 
the previous assumption on {a;}, if 


foo) 


s(t) = a: f(t — kT) (10) 


is transmitted over a channel such that the received signal is simply 


r(t) = s(t) + x(t)s’(0), (11) 
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where x(t) is an unknown (continuous) function satisfying, if x(t) is 
random, 


E{|x(t)/?} =a, Ef{x(t)} =0, (11a) 
or (say) if not, then 
|x(t)| <a. (11b) 


It is convenient, and sufficient, to consider the case T' = 1 in (8), so 
that the quantity of interest is 


2=|1-fOP+ DAMP+e Vi@P 2 


This is to be minimized over functions f in B,(Q); i.e., functions in L, 
of the form 


f() = om F(w)e' dw. (18) 
2r J-o 


We need only consider the case 0 < 2 < 27, since for 2 = 27 we can 
make ¢” = 0 by taking f(t) = (sin zt)?/(zt)*. The transmission system, 
of course, is useless if «? = 1, but we can always make ¢? < 1 by 
appropriate choice (or scaling) of f, (f = 0 giving e = 1). 

In general, we will not have f(0) = 1 for the optimal f. This is easily 
seen by defining the governing quantity in the problem, viz., 


w(Q; x) = inf { yy F(R)? + 0? S | f’(R) rt. (14) 
pees kx#0 —00 


It follows from this definition that 
2 [f(k) |? +a? Y 1 f'(R) 1? = nO; a) | f(0) 7, fin Bo(Q). (15) 
#0 —o0 


Thus from (12) and (15) we have 


& = |1—f(0)|? + | f(O)|7u(Q; a), fin Bo(). (16) 
The quantity on the right is minimized by taking 
f(0) = y = ¥(Q; a) = {1 + w(Q, ad}, (17) 
giving 
2. _ HQ; a) (18) 


© 1+ n(Q; a)’ 


Thus if the infimum in (14) is attained for f = f(t; Q, a), then equality 
will hold in (18) for the optimal function 


folé; 2, a) = yf(t Q, a). (19) 
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In cases of practical interest, » will be small, and hence fo(0) will be 
slightly less than 1. 


Il. RESULTS 


The results are summarized in the following: 
Theorem: Define for a>0,2> 0, 


T aQ 
0.0) = 
p(Q; a) Q  arctan(aQ) 


where arctan(-) is between 0 and 1/2. Then in (14) we have, for0 <Q 
S77, 


(20) 


u(Q; a) = p(Q; a) — 1, (21) 
and forz <Q = 2z, 


ao | srcan(ats) 
pO) S43 BP 8 ee ay (22) 
apa 
where 
B= 2 — (Q/z). (22a) 
For Q 2 2zr, 
w(Q; a) = 0. (23) 
Furthermore, for 0 < Q < z, the infimum in (19) is attained only for 
Q 
Flt) = f(t; 2, a) = (95 0) = { pede (24) 


and for x <Q < 21, a> 0, only for 
f(t) = (1 — A)f(t; Ba, a) 


+ A{P(t; Q)cos rt — z @’(t; Q)sin wt}, (25) 
ne 


where f(t; -, a) is defined in (24), B is defined in (22a), and 


1-8, 
1=— (25a) 
arctan(aB7) 
a mer re 
o(t; 2) = oe Ls (25b) 


Ill. DISCUSSION 


In the simple fading channel model we have fixed the sampling 
interval T' to be unity, so that Q = zm corresponds to Nyquist-rate 
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transmission, and solved the minimization problem for 0 < 0 < 27. 
The case 0 < 2 < z might be considered uninteresting, for in this case 
the bandwidth is too small for the data rate. However, the solution for 
this “uninteresting” case enters in the solution for the interesting 
case, 7 <Q < 2z. 

From the results stated in the Theorem, the optimal function in 
B.(Q) for minimizing the mean-square error is found to be 


am fi t 
folt; 2, «) = 5 { de, = 0<2<z, (26) 
Vv = 


Lace 
fo(t; Q, a) = folt; 2m — Q, a) + holt; Q), m<Q<2n, (27) 
where 
ho(t; Q) = ae xt — - \s snl = tn at 
1 


~ On 


( - lel) osat dw, (Q,} = 27 —). ~=(28) 
2,<|w|<2 2 


Tv 


The resulting minimum mean-square error is 


Q arctan(aQ) 


&(Q; a) =1—- ; 0<Q <7, (29) 
T aQ 
t 21 — 
&(Q; a) = pace Nee he AC 21 <Q < 27, (30) 
T a(2x — Q) 


where 0 $ arctan(-) < 7/2. 
The main interest attaches to this quantity for 7 < 2 < 27 anda 
small, in which case 
arn? 
6(Q; a) = ar {2 — (Q/n)}°. (31) 
So there is an interesting trade-off between error and bandwidth in 
this case. 
In the other direction we have 
lim &(Q; a) = 2 — (Q/z), mr<iQ < 2Qr. (32) 
Thus, if a is very large, 2 must be very near 27 in order for the error 
to be small; i.e., one may as well take 02: = 22 for very large a. However, 
other practical considerations enter in the case of large a; e.g., the 
necessities of very accurate sampling and very close approximation of 
the extremal function. Also, additive noise, which has been neglected 
in this analysis, will be magnified by automatic gain control during 
periods of deep fading. So the results are deemed of no practical value 
in the case of large a. 
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Notice that the extremal function for the case 7 < 2 < 27 has two 
distinct components, the low-frequency component being the extremal 
function for the frequency (27 — Q). The other component, ho(t; Q) is 
a bandpass function that does not depend on a, since as may be seen 
from the first line in (28), its derivative vanishes at the integers. From 
the second line in (28), it is seen that ho(t; 2) is a bandpass version 
(center frequency z, upper frequency ®) of (sin xt)?/(zt), the extremal 
function for 2 = 27. Graphs of Fo(w; 2, a), the Fourier transform of 
fo(t; 2, x), are shown in Fig. 1 for several values of 2 and a”. 

It is doubtful that detailed statistics on fading channels, even if 
available, would be useful in practice, except to the extent that they 
could give a rough idea of what might be expected. Generally speaking, 
large errors cannot be tolerated over long intervals (negating, to some 


-2 a4 0 1 2 
(a) 

-2 -1 0 1 2 
(b) 

-2 -1 0 1 2 


(c) 


Fig. 1—The Fourier transform Fo(w/x; Q, a) of the extremal function for various 
values oF a anda. (a) Q = 1.27, a? = 10. (b) Q = 1.52, a* = 1. (c) Q = 1.80, a? = 0.1. 
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extent, the adoption of the mean-square error criterion), and one is 
faced with the dilemma of uncertainty in using such channels. In the 
model here, the “expected value of a?” might be regarded from a 
pragmatical viewpoint as a mathematical non sequitur which might 
better be replaced by a design hypothesis, | x(t) | < a. Then the use of 
fo(t; Q, «) guarantees that the mean-square error will not exceed 
e2(Q; a) if the hypothesis is true. The design problem is much like that 
of deciding how much insurance to buy. 

On the other hand, one might adopt a hedging strategy, where extra 
bandwidth is used to guard against fading, while zero error is obtained 
in the absence of fading by using a truly interpolatory f. That is, in 
the problem of minimizing 


TAP + oe SE (33) 
k#0 —oo 


over f in B.(Q), 7 < Q < 27, f(0) = 1, one decides to make the first 
sum zero and minimize the second sum under the constraints. The 
extremal function for this problem, then, depends only on Q, and is 
found to bet 


f(t; Q) = pin at [pee sa + ( - 2) cos(Q — at, 


at at 


(r<Q< 27), (84) 


the minimum value of the second sum in (33) being, under the 
constraints, 


foe} 2 3 
Eins mr=Z (2-8), (x <Q < 2m). (35) 


The Fourier transform of f; is 
F,(@; 2) = 1 for |w| < 27 — Q, 


Oe ee lee ee ery ea 
Qa 


= 0 for |w] > 2. (36) 


The graph of F,(w; Q) (see Fig. 2) suggests a log cabin; so it will be 
called the log-cabin characteristic, and f,(¢; 2) will be called the log- 
cabin kernel. | 

One may choose an optimum scaling of the log-cabin kernel as a 
substitute for fo(t; 2, a) in the original problem of minimizing, over f 
in B,(Q), 7 << 2 S 2z, 


T It is no surprise that f,(t; 2) = limo fo(t; Q, a). 
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Fig. 2—The log-cabin characteristic. 


&(f) =|1-fO Pr + X | F(R) PP + a? 2 | f’(k) |?. (37) 
The resulting function is 


fro(t; 2, a) = v(t; Q), (38) 


a a a\?\* 
=/1 2-- : 
rae (2-2) 


(In case a is small, the optimum scaling may be ignored, as it only 
amounts to a second-order correction.) The optimally scaled log-cabin 
kernel gives for the mean-square error in the original problem, 


where 











oe 9 — Q : 

€o(Q; a) = a us ae x<Q S Qz. (39) 
ann Q 
1+ foe 
(2-3) 


In Fig. 3, 10 logiofes(Q; a)} and 10 logyo{e{o(Q; a)} are plotted, for 
various values of a”, versus the normalized angular frequency, 0/z, 
which corresponds to 2WT for a top frequency W and a sampling 
interval T. To evaluate the effectiveness of fo(t; 2, a), some reference 
value of mean-square error must be adopted. It seems natural that 
6(Q; a) should be compared with e7o(z; a), the minimum mean-square 
error obtainable (at Nyquist rate) with an optimal scaling of (sin zt)/ 
at. Thus, for a* = 1, a 50-percent increase in bandwidth gives an 
improvement of approximately 6.3 dB. For small values of a’, the 
improvement in decibels is approximately —30 logyo[2 — (Q/7)], which 
for 2 = 1.57 is approximately 9 dB; i.e., the mean-square error is 
reduced by a factor of 8 for a 50-percent increase in bandwidth. The 
difference between «3(Q; a) and e{o(Q; a) is insignificant for moderate 
to small values of a”. Of course, the extremely small errors given for 
Q near 27 have to be discounted in practice as being purely theoretical 


values. 


BANDWIDTH-ERROR EXCHANGE — 1695 


MEAN-SQUARE ERROR !N DECIBELS 





1.0 1.1 1.2 13 1.4 15 1.6 1.7 1.8 1.9 2.0 
2WT = Q/a 


Fig. 3—Bandwidth-error exchange for various values of a’. The solid line is for the 
optimal kernel; the dotted line is for the log-cabin kernel. 


Whether or not the simple fading channel model is wholly accepta- 
ble, there is another reason for recommending or rationalizing the use 
of fo(t; 2, a), or the log-cabin kernel; this reason being a reduced 
sensitivity to sampling jitter. We outline the justification of this 
reason. 


In the problem of sampling jitter, it is supposed that 


oo 


s(t) = Y asf(t — R), (40) 


—o 
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where f is a real-valued function in B2(Q) and the a, are independent 
random variables of zero mean and unit variance. In the absence of 
sampling jitter, f is taken, in case 2 = 7, to be (sin rt)/zt, so that s(t) 
is sampled at time(s) t = n to obtain a, = s(n). In the presence of 
sampling jitter, s(t) is inadvertently sampled at time(s) t = n + 7, 
where 7 is assumed to be a random variable with density p(7), usually 
symmetric about 7 = 0. This jitter results in an error 


én = An{l — f(r)} — Y aef(r7 +n — k). (41) 


k¥n 


The expected mean-square error over the {a;} is 
e(7) = {1 -— f(r)? + Y PR +7) 
k#0 


=1-2f(r) + y P(k +7), (42) 


which, when averaged over 7, gives 


eH] 2 ie f(r)p(r)dr + i | 2 P(R + fptear (43) 


In case 0 < 2 < z, the sum in the last integral is [*, f?(t)dt, 
independent of r. In this case, 


Ce =S1=—2 { f(7)p(r)dr + i f?(t)dt. (44) 


It is easy to show that ¢ in (44), which is the same as & in (43) only 
for 0 <Q < 7g, is minimized over f in B,(Q) by taking 


jo = Jp) B22 ay, (45) 


That is, f(t) is obtained by bandlimiting p(t) [projecting p(t) 
on B,(Q)]. 
Recall (26), where 


1 iM cos wt 
: 0. a) === — 775 dw, <Q <7. 
folt ) Qn Jolt aw pra 

From this, it is seen that 
ea iti/e os A(t — 
fo(t; Q, a) = { oo st 7) dr, 0<Q<7. (46) 
-o 2a a(t — T) 


Thus, for 0 < Q < 7, fo(t; 2, a) may be interpreted as the function in 
B.(Q) which minimizes the mean-square error due to (reduced band- 
width and) a sampling jitter 7 with density (2a)~1e7!7'""*, -0 <7 <0, 
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The fact that 7 is unbounded requires the assumption of an ensemble 
of sampling mechanisms. However, this is not important in the case 
of small a, when all that really matters is the second moment of p(7). 

The mean-square error due to sampling jitter can be decreased at 
the expense of extra bandwidth. For 2 > z, the sum ¥°.. f7(k + 7) is 
no longer, because of aliasing, the same as f@.. f*(t)dt. The case 7 < 0 
< 27 can be treated, as in the proof of the Theorem, by decomposing 
fas 


f(t) = g(t) + h(t), 


where g is the low-frequency component, bandlimited to [—(2a — Q), 
(2x — Q)], and h is the bandpass component with center frequency 7 
and upper frequency ©. It turns out that the low-frequency component 
of the optimal f is obtained by bandlimiting the density p(7). So in 
case of the two-sided exponential density, g(t) = fo(t; 27 — Q, a) is 
optimal, as in fo(t; 2, a). The optimal bandpass component h(t) is not 
quite the same as ho(t; 2) in (28), but is very close to ho(t; 2) in case 
the density is symmetric with small support [—a, a]. In fact, as a > 0, 
the optimal f in B,(Q), 7 < Q < 27, for the (symmetric) sampling-jitter 
problem tends to f,(t; 2), the log-cabin kernel. So the log-cabin kernel 
is near optimal for reducing, at the expense of bandwidth, the mean- 
square error due to small symmetrically distributed sampling jitter. 

An obvious generalization of the problem considered here is the 
minimization over f in B2(Q) of 


co 


N 
eth FO) » | F(R) |? + x On { Ly | F(R) et. (47) 


=—0O 


In response to a question raised by the Referee, a numerical approach 
to this problem is not recommended, at least for moderate N, because, 
first, the analytical solution is tractable, simpler, and exact; and 
second, certain anomalies can be anticipated. 

For 0 < Q < 7a, the solution is almost trivial, the extremal function 
being 


Q 
t 
folts 9, &y) = = { See de, <2<7), (48) 
T 


-2 P(w; ON 
where 


N 
P(w*; @vy) =1+ ¥ aw". 
n=1 


Always (for any 2), the minimum mean-square error is 


€(Q; &v) = 1 — fo(0; Q, ay). (49) 
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Clearly, «?(f) can be made zero for 2 > (N + 1)z by taking f(t) = 
(sin rt)8*1/(at)%*1. For r <Q <(N + 1)z, aliasing complicates the 
problem. However, the aliasing can be handled in a systematic way, 
and the solution, if tedious, is straightforward, except in anomalous 
cases where only even-order derivatives appear in the last sum in (47). 
In the anomalous cases, the minimum, or properly speaking, the 
infimum of ¢(f) over f in B.(Q) is not attainable for 7 < 2 < 
(N + 1)z. For example, 


0 A ao ed A) a » | F(R) |? + of D | fk)? 


can be made arbitrarily small for f in B2(27), but only at the expense 
of making the norm (in L,) of f very large. 

There will be near-anomalous cases where the infimum is attained 
but for an f of large norm, i.e., cases where the solution is not 
satisfactory for reasons not taken into account. One consideration is 
the average power of the transmitted signal, which is directly propor- 
tional to {%. f?(t)dt. Another consideration is the sensitivity to sam- 
pling jitter. 

It is obvious, then, in the general problem, that ¢(f) should be 
minimized with a constraint on the norm of f. This severely compli- 
cates the problem. The practical alternative is to solve the problem 
and then appraise the solution. What is needed is a better understand- 
ing of how the alphas affect the norm of the solution for 2 > z. 
It turns out for the simple problem considered here that the norm of 
fo(t; 2, aw) decreases with Q for 7 < Q < 2z. It would be nice to have, 
at least, sufficient conditions on the alphas for this decrease to obtain 
in the general problem. On the basis of the solution (48) to the general 
problem for 2 = z, and the analogous sampling-jitter problem, it is 
conjectured that a sufficient condition on the alphas is that the 
reciprocal of the polynomial P(w’; @y) in (48) be the Fourier transform 


of a positive function. 


IV. PROOF OF THE THEOREM 


The minimization problem is obviously equivalent to the problem 
of minimizing over f in B,(Q), 


Q(f; a) = 2 | f(R) |? + a? 2 | f’(R) 17, (50) 
subject to f(0) = 1. 
Now suppose f = f, + ife, where f, and f. are real-valued functions 
in B2(Q) with f,(0) = 1, f2(0) = 0. Then, obviously, 
Q(f; a) = Qf; a) + Q(f, a). (51) 
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So we may restrict our attention to real-valued f in B.(Q). Next suppose 
that f = f; + fo, where f, and f. are real-valued functions in B.(Q), fi 
being even, and f2 odd. Once again we obtain (51), since f; and f, 
(fi and f3) are orthogonal over the integers. So we may restrict our 
attention to real-valued even functions f in B2(Q), f(0) = 1, in seeking 
the minimum. 

Now define 


Q*(f; a) = [ | f(t) [?dé + a | | f(t) Pde. (52) 


Then we have 
Lemma 1. For f in B2(Q),0< Q < z, we have 


Qf; a) = Q*(f; a). (53) 


This follows from the well-known result, 
edb i Ret Oy" =f | f(é) |?de, 


fin BQ), Oreal, O<7<7/Q, (54) 


and the fact that if f belongs to B2(Q) then so does f’. O 
Next define 


p(Q; a) = inf {Q*(f;a)}, O<2< a, 
FEB,(Q) 


7(0)=1 
0O<sa<o, (55) 
Then it is a simple matter to establish: 
Lemma 2. We have 
T (aQ) 
Q; a) = >. 5 
e(a; a) Q  arctan(aQ)’ (56) 


and the infimum in (55) is attained for, and only for, 
or 7 ; ns { ° cos wt 
f(t) = AG Q, a) — p(Q; a) m Fae Ie Wes are w. 
It follows from Lemmas 1 and 2, and the definition (14) of u(Q; a), 
that 


(57) 


u(Q; a) = p(Q; a) — 1, 0<QK<nz. (58) 


So Lemmas 1 and 2 establish the first part of the theorem. 


Proof of Lemma 2. From the previous argument [which applies as well 
to the integrals in (52)], the extremal f will be real and even; i.e., 


f(t) = = ie F(w) cos wt dw, f(0) = 1, (59) 
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where F'(w) is real and even. Then 


Q*(f; a) = a 1 F?(w)(1 + a?w*)dw. (60) 
2r J-o 


Now suppose g is any real even function in B,(Q) satisfying g(0) = 
0; i.e., 


g(t) = = i Glw)eos wt di, { G(w)dw =0. (61) 
Then 
art ea) —2 | [fla) + Gle)P + oak de 
= Q*(f; a) + Q*(g; a) 
n - ‘i F(w)G(w)(1 + o20)dw. (62) 


The last integral vanishes, in accord with (61), if 


c 
F(w) = Le wut’ (63) 

Then setting f + g = f,, we have f; in B2(Q), f:(0) = 1, and 
Qf) = Q(f; a) + QA — f; a) = QCA; a) (64) 


with equality throughout if, and only if, f; — f = 0. 
So the restriction to [—Q, Q] of F(w) given by (63) is the Fourier 
transform of the extremal function, provided 


c ié dw 
nde Teratgee 
requiring 
_@_ (aM) 
ae arctan(aQ) * 08) 
Then 
1 2 
p(Q; a) = sl F?(w)(1 + a%w?)dw 
2r J-o 
c? { dw 
“ardaltawt oO ee 


Now to find the extremal f in B.(Q), 7 < Q < 27, we first introduce 
B.(a, b), the class of square-integrable bandlimited functions whose 
Fourier transforms vanish outside the intervals [—b, —a], [a, b], where 


BANDWIDTH-ERROR EXCHANGE 1701 


0 <a< b <_o, (In case a > 0, the functions are called bandpass 
functions.) For fin B2(Q), 7 <Q < 27, we may make the decomposition, 


f(t) = g(t) + h(t), gin B,(6r), 
h in B2(Bz, Q), (68) 


where 6 = 2 — (Q/7), 0 <8 <1. Next we use the general representation 
for h in B.( Bz, Q), 


h(t) = p(t)cos xt + q(t) sin rt, DP, q in B(8'm), (69) 


where 8’ = 1—8>0. 
In Q(f; a), we need the values of f(k) and f’(k), which with the 
decomposition (68) become 


f(k) = g(k) + (-1)*p(R), (70) 
f’(k) = g'(k) + (-1)"[p’(k) + 2q(k)]. (71) 


Now it is convenient at this point to use the fact that the extremal f 
will be real, allowing us to write 


QU; a) = E elk) + (-D'pWP 
+ a? S fa'(h) + (C1) (b) + a(F)1? 
= ¥ gh) +o Ee’ HP +E pee) 
+o? E [p(k + ra(bP + 2D (eth le) 


+20? S, (-1)'4"(k)[p'(k) + +q()] (72) 


Now recall that g and g’ belong to B2(G7); p and [p’ + aq] belong 
to B,(B’x), where 0 < 8 < 1 and 8’ = 1 — @. Therefore, gp and 
g’[p’ + mq] belong to B,(7), the class of absolutely integrable (L;) 
functions whose Fourier transforms vanish outside [—7z, 7]. Hence, 
their Fourier transforms vanish at the endpoints, +7, since the Fourier 
transform of a function of L; is continuous. If follows, for example, 
from the Poisson sum formula, that the last two sums in (72) vanish, 
and the other sums may be written as integrals, in accord with (54). 
Thus 


Q(f; a) = i. g°(t)dt + a” 1. [g’(t) dt 


ra pieyde + a f [p’(t) + wq(t)Pdt, (73) 
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g in B2(Br), p, g in B2(B’7). 
We wish to minimize this quantity subject to 


f(0) = p(0) + g(0) = 1. 
First we make the last integral vanish by taking 
1 , 
q(t) = — — p’(t), 
“us 


noting that this is not necessary in case a = 0. Then 


Q(f; a) = Q*(g; a) + { p*(t)dt, g in Bo(6r), 
D in B.(B’z). 


Now suppose 
p(0)=X and g(0)=1-.. 


Since 





p(0) = { p(t) = at 


we have, from Schwarz’s inequality, 





{ p*(t)dt = p*(0)/6’, 
with equality holding if, and only if, 
p(t) =p) SE, (pn = 0-2). 


From Lemma 2, we have 
Q*(g; a) = g°(0)-p(Br; a), 
with equality holding if, and only if, 
g(t) = g(0)- f(t; Bx, a). 
Therefore 
Q(f; a) > min {(1 — d)*o(Bm; a) + 0°/8"} 


where 8’ =1—6>0. 
The minimum occurs for 


1-6 


A= 1-6) + [e(@m ay 


giving 
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(74) 


(75) 


(76) 


(77) 


(78) 


(79) 


(80) 


(81) 


(82) 


(83) 
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Q(f; a) = {[o(6x; a)" +1—- By =m, a), r<Q< 2x, (84) 
with equality holding (for a > 0) if, and only if, 
f(t) = (1 — A)f(t; Ba, aw) +X {o(deos at — - ¢’(t)sin rth, (85) 


where f(t; 9, a) is defined in (57), \ is given by (83), and 
sin(Q — x)t 


p(t) = ee ae (86) 
The uniqueness qualification, “for a > 0”, owes to the fact [cf. (74)] 
that we need not have q(t) = —p’(t)/z, in case a = 0, in order to 


minimize Q(f; a) with f(0) = 1. Obviously, any function in B.(Q) 
satisfying f(0) = 1 and having (sin zt)/xt as a factor will minimize 
Q(f; 0). Since equality may attain in (84) for f given by (85), we have 


w(Q; a) = m(Q; a) — 1, m<Q < 2z, (87) 


and the theorem is proved. O 
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A large-scale, single-period, mathematical programming model of a multi- 
commodity distribution system has been designed and implemented to analyze 
and to help reconfigure AT&T distribution facilities. Within this model, the 
number, size, and location of major stocking locations and subsidiary stocking 
locations are determined on the basis of various incurred costs. These costs 
include facility setup costs, facility closing costs, and shipping, inventorying, 
handling, and operating costs. The model incorporates various features that 
do not appear in standard facility location models, such as nonlinear economies 
of scale in operating cost, capacity constraints, special products that are 
handled by only a limited number of facilities, and establishment of subsidiary 
stocking locations where desirable. In this paper we describe the model, provide 
a mathematical programming formulation of the problem, and describe the 
algorithm that was developed to obtain good solutions in an efficient manner. 
The flexibility of the formulation and the efficiency of the solution technique 
make this model a unique and useful tool. It can provide insight when used to 
study an existing or proposed distribution system, and it has already been 
used in a variety of case studies. 


]. INTRODUCTION 


Large manufacturing and industrial concerns, such as AT&T, pro- 
vide for the warehousing and distribution of finished goods. Decisions 
concerning the number, size, and locations of warehousing and distri- 
bution facilities greatly affect the cost of a large material logistics 
system. 


*AT&T Bell Laboratories. 
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This paper describes a large-scale, single-period, mathematical pro- 
gramming model of a multicommodity distribution system. Various 
quantitative studies of distribution problems can be performed using 
this model. For example, an analyst can examine the trade-off between 
many small distribution facilities and a few large facilities, determine 
a consolidation strategy in areas of contracting demand, plan an 
expansion strategy in areas of increasing demand, and so forth. 

This model currently is being used to analyze and help reconfigure 
AT&T distribution facilities in order to meet future material logistics 
requirements. The flexibility of the model has allowed it to be applied 
in a variety of in-depth case studies. These studies have included both 
national and regional studies, studies of different tiers within the 
distribution system, and studies involving different families of prod- 
ucts. To provide this flexibility, components of the model are described 
in generic terms with a minimum of restrictive assumptions. 

Within the model, the number, size, and location of major stocking 
facilities, called Distribution Centers (DCs), and subsidiary stocking 
facilities, called Local Distribution Centers (LDCs), are determined 
on the basis of various incurred costs, including facility setup costs, 
facility closing costs, and shipping, inventorying, handling, and oper- 
ating costs. The model is quite complex and combines several features, 
or combinations of features, that do not appear in standard facility 
location models. Among these features are nonlinear economies of 
scale in operating costs, capacity constraints, special products that are 
handled by only a limited number of facilities, and establishment of 
subsidiary stocking locations where desirable. These features are dis- 
cussed in Section II. 

The generic distribution system that is considered is illustrated in 
Fig. 1. We describe this system briefly here; it is described in greater 
detail in Section II. Within the model, products are assumed to move 
from various vendors (and repair shops) to the DCs. These DCs, in 
turn, distribute these products to demand area locations. In certain 
instances, a group of demand areas can also be served by an LDC. 
Different products move according to different patterns among ven- 
dors, DCs, LDCs and demand areas; these different product “types” 
are described in particular in Section 2.2. The mathematical program- 
ming formulation of this problem is quite large; for example, a problem 
with 50 possible major distribution center locations, 20 subsidiary 
stocking locations, 100 demand areas, and 1 special product would 
involve 13,170 variables and 110,531 constraints. A sample problem of 
this size was solved in 115.6 CPU seconds on an Amdahl 470/V8 
computer. 

An extensive literature exists on operations research techniques for 
discrete facility location problems. A recent survey of facility location 
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Fig. 1—The advanced distribution and location model. 


work is given in Francis, McGinnis, and White.’ We will here only 
mention briefly some representative problems and papers. 

In the p-median problem, p facilities are chosen so as to minimize 
the sum of the distances from facilities to customers. A Lagrangian 
relaxation technique for a capacitated p-median problem is incorpo- 
rated into the algorithm described in this paper (see Section 3.4). 
Surveys of work on p-median and related network location subjects 
can be found in Refs. 2 and 3. 

A classic model is the Uncapacitated Facility Location Problem 
(UFLP), in which there exists a trade-off between setup costs of 
facilities and the costs of shipping products to demand areas.*” Com- 
plete surveys of work on UFLP can be found in Refs. 8 and 9. 

Some facility location problems can be modeled as generalized 
assignment problems (see Ref. 10). Here, customers with demands 
must be allocated among given facilities with limited capacity. Each 
customer must be served by only one facility. Lagrangian relaxation 
techniques for this problem are discussed in Fisher’! and Ross and 
Soland.’° A related procedure is incorporated into the algorithm de- 
scribed here in Section 3.2. “es a 

The Capacitated Facility Location Problem (CFLP)—in which a 
customer’s demand can be split among several facilities—is considered, 
for example, in Refs. 12, 18, and 14. Some authors have considered 
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the case in which there are concave operating costs associated with 
each facility. See, for example, Refs. 15, 16, and 17. In particular, the 
algorithm of Kelly and Khumawala,” which involves solving a se- 
quence of linear transportation problems, is adapted here for use in 
the major optimization routine (see Section 3.3). 

A Benders decomposition approach for a multicommodity problem 
was developed by Geoffrion and Graves.'® Discussion of dynamic 
facility location can be found in Erlenkotter’® and in a section of the 
capacity expansion survey by Luss.”° 

In Section II, we provide a complete description of the material 
logistics system being modeled and the mathematical programming 
formulation of the problem. Section III describes the solution tech- 
nique that was implemented. The first subsection provides a general 
overview of the various stages of the algorithm. The subsequent 
subsections describe each stage in turn. Some brief discussion of 
implementation details and some concluding remarks are then given 
in Section IV. 


Il. DESCRIPTION OF THE PROBLEM 


2.1 Types of locations and facilities 


Five major types of facilities or physical locations can be identified 
in the material logistics system. Several products k = 1, --- , K move 
among these facilities and locations. 

Vendors are those locations, such as factories and manufacturing 
locations, from which products first enter the material logistics system. 
Each product k has its own set of fixed and known vendor locations 
u= l, 9, Ey N. ke 

Repair shops are similar to vendors. Each product has its own set of 
fixed and known repair shop locations m = 1, ---, My. For each 
product, some fraction of demand p; (possibly zero) is to be satisfied 
by repaired items. 

Demand areas are geographical areas to which products are ulti- 
mately destined. These demand areas j = 1, , J, and the amount 

__-of demand Dj, for each product k at each seaiand area J, are assumed 
to be fixed and known. 

Distribution centers are major intermediate stocking locations. The 
DCs must be chosen from among a set of locations i = 1, --- , J, which 
can be either “potential” or “existing.” For potential locations, the 
model specifies a minimum capacity size B}, a feasible capacity incre- 
ment b}, and a maximum capacity size B}. For existing facilities with 
fixed capacity, we assume B} = Bi and b} = 0. DC i can serve demand 
areas within a radius of 6} miles. 

Local distribution centers are subsidiary stocking locations that 
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handle certain types of products for several nearby demand areas. 
Depending upon the product type, LDCs receive products from either 
a DC or directly from vendors and repair shops; they then ship the 
products to demand areas. LDC locations must be chosen from among 
a set of existing and potential locations 7 = 1, ---, L. Potential 
facilities have minimum capacity B2, maximum capacity B2, and 
capacity increment b2. Existing facilities have fixed capacity B? = 
B2 and b2 = 0. An LDC Z can serve demand areas within a radius of 
62 miles. 

Ordinarily, each DC deals only with certain demand areas that are 
“assigned” to it. The DC is (perhaps) associated with some LDCs. 
Each demand area that is assigned to a given DC obtains products 
only from that DC, and from at most one of the LDCs associated with 
it. Some products are exceptions to this rule; they are discussed in 
Section 2.2. 


2.2 Types of products 


Many different products move through this material logistics sys- 
tem. For modeling purposes, each “product” may represent an aggre- 
gation of several products. We distinguish four categories or types of 
products, according to how they are handled within the system. These 
types are described below and are pictured in Fig. 1. In Fig. 1, the 
three demand areas in the lower right corner are assigned to an LDC 
as well as a DC, whereas the other demand areas are assigned only to 
a DC. 

Type 1 products can be handled only by a DC. These products are 
shipped to the DC from the vendor and repair shop. (We assume that 
the particular vendor and repair shop that supply a given DC are 
chosen so as to minimize shipping cost.) From the DC, they are shipped 
to all demand areas that are assigned to that DC. 

Type 2 products can be delivered to demand areas either from a DC 
or else from an LDC. If a demand area is assigned to an LDC, the DC 
ships Type 2 products to the LDC for delivery to the demand area. 
This is indicated by the dashed lines in Fig. 1 that are labeled with 
the numeral 2. 

Type 3 products can also be delivered to demand areas either from 
a DC or else from an LDC. The LDC receives shipment of Type 3 
products directly from the vendor and repair shop. We assume that 
the LDC receives shipments from the same repair shops and vendors 
that serve the DC with which the LDC is associated. This is indicated 
by the dashed lines in Fig. 1 that are labeled with the numeral 3. 

All Type 1, Type 2 and Type 3 products are handled with the 
arrangement whereby each demand area deals with only one DC and 
at most one LDC. (If a demand area is assigned to an LDC, it obtains 
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all Type 2 and Type 3 products via that LDC.) However, Type 4 
products (“special” products) are an exception to the rule. Only a 
small number p; of DCs are chosen to handle a given Type 4 product 
k. These DCs then serve all demand areas. This is indicated by the 
alternate short-and-long dashed lines in Fig. 1. 

Each product’s type is given as input by the user. We let T;, T:, Ts, 
and T, denote the sets of indices of Type 1, Type 2, Type 3, and Type 
4 products, respectively. 

Each unit of demand that is handled by a DC or an LDC occupies 
some average amount of warehouse space (measured in units of capac- 
ity). The amount of space occupied depends upon, among other things, 
the size of the product, turnover time of inventory, and the type of 
facility (DC or LDC). We define the following parameters: 


sk = warehouse space required per unit demand at a DC if a DC ships 


product k to a demand area, 


sj? = warehouse space required per unit demand at an LDC, 

si. = warehouse space required per unit demand at a DC if a DC ships 
product k to an LDC (relevant for Type 2 products). 

2.3 Costs 


2.3.1 Facility setup costs 


Facility setup costs are costs incurred when a facility (DC or LDC) 
is chosen to be open. The setup cost depends upon the size of the 
facility. We assume that, for DC i, opened with a capacity of x units, 
the setup cost is of the form aj + $x, where a} and £} are given 
constants that depend upon the particular facility. For LDC Z, the 
setup cost is of the form a2 + 62x, where a? and 8? are given constants. 
Actual total setup costs realistically might be assumed to be amortized 
over several time periods. Since ours is a static, single-period model, 
the setup cost used in the model could be set equal to an amortized 
share of the total setup cost. 


2.3.2 Facility closing costs 

In the model, if a facility is not chosen to be open, a closing cost c} 
(for DC i) or c? (for LDC Z) is incurred. 

2.3.3 Shipping costs 


Shipping cost parameters are given in cost per mile per unit demand. 
These cost parameters for product k follow: 


t} = cost of shipping from vendor/repair shop to DC/LDC, 
t%? = cost of shipping from DC to demand area, 

tz = cost of shipping from DC to LDC, 

t; = the cost of shipping from LDC to demand area. 


1710 TECHNICAL JOURNAL, SEPTEMBER 1985 


Obviously, for each type of product only certain of these parameters 
are applicable. (The model can also be easily modified so that shipping 
costs are measured per unit demand, independent of distance.) 


2.3.4 Inventorying or storage cost 


An inventorying or storage cost per unit demand is incurred at each 
DC and LDC. We define the following parameters: 


nki = inventorying cost per unit demand at a DC i that ships product 
k to a demand area, 

niv = inventorying cost per unit demand at an LDC / that ships 
product k to a demand area, 

nii = inventorying cost per unit demand at a DC i that ships product 
k to an LDC (relevant for Type 2 products). 


The value of 7}; can differ from 7}; because of differences in turnover 
time for inventory bound for an LDC and inventory bound directly 
for demand areas. 


2.3.5 Handling cost 


When a product is processed at a warehousing facility (either a DC 
or an LDC), there are some labor costs incurred. We define the 
following parameters: 


h}; = handling cost per unit demand at a DC i that ships product k 
to a demand area, 


hz, = handling cost per unit demand at an LDC Z that ships product 
k to a demand area, 
h3; = handling cost per unit demand at a DC i that ships product k 


to an LDC ¢ (relevant for Type 2 products). 


2.3.6 Operating cost 


Other operating costs at each facility are represented as a continu- 
ous, nondecreasing, concave function of the space occupied in order to 
account for possible savings due to economies of scale. Let 


o} = volume of space occupied at DC i, i= 1, --- , J, and 
ao = volume of space occupied at LDC 7, 7= 1, ---, L. 
Then, we define the cost functions: 

fi(ot) = operating cost at DC i,i=1,---, J, and 

f(o2) = operating cost at LDC 7,7=1,---,L. 


Within the software implementation, we assume these cost functions 
to be piecewise linear with a nonnegative intercept at the origin. In 
typical examples, each function used between three and five piecewise 


linear segments. 
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2.4 The mathematical programming model 
2.4.1 Decision variables 


To formulate the mathematical programming model, we first specify 
the necessary decision variables. All variables take on integer values. 
They are as follows: 


1 if DC i is open, 
y= fori=1,---,J, 
0 otherwise, 


1 if LDC Z is open and served by DC i, 
for7=1,---,L and i=1,.---,J, 


Ziv 
0 otherwise, 


1 if demand area j is assigned to DC 1, 
forti=1,---,J and j=1,---,d, 
0 otherwise, 


1 if demand area j is assigned to LDC Z, 
w= for7Z=1,---,L and j=1,---,d, 


0 otherwise, 


1 if DC i is used to serve special product k, 
fori=1,---,J and RE T,, 
0 otherwise, 


1 if demand area j receives special product k from DC 1, 
forj=1,---,J and i=1,.---,J 
0 otherwise, and RE 7,, 


Uijk = 


number of size increments 
qi = above the minimum 


opened at DC 1, fori =1,---, J, 


number of size increments 
q? = above the minimum 
opened at LDC /, for 7 = 1, ---, L. 


2.4.2 Distance and other parameters 


Shipping cost calculations require the distances between pairs of 
locations. For notational simplicity, we refer to all such pairwise 
distances by the notation d with two subscripts. The subscript i refers 
to a DC i, subscript 7 to an LDC Z, and subscript j to a demand area 
j. Further, the subscript n(i) refers to the vendor closest to DC i, and 
subscript m(1) refers to the repair shop closest to DC 1. Thus, d,(iy,; is 
the distance to DC i from its nearest vendor, d,; is the distance from 
LDC / to demand area j, and so forth. 
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A particular latitude/longitude point is associated with each facility 
location and each demand area and used to estimate road distances 
on an as-needed basis. The software allows users to provide distance 
data that would override the calculated distance. 

To simplify the formulation, let the space (units of capacity) re- 
quired for a DC to serve demand area j (excluding Type 4 products 
and provided it is not served by an LDC) be 

S7= 2 — skp. (1) 
hRET,UT2UT 3 
If demand area is served by an LDC as well as a DC, then the amount 
of space required at the LDC is 
S} = y si Dye, (2) 


kET,UT3 
and the amount of space required at the DC is 
S? = y shDjx + ys, si Dyn. (3) 


RET, RET» 
The space required at a DC to serve Type 4 product k € T, for demand 
area ] is 
Si, = sh Dj. (4) 
Likewise, it is convenient to aggregate the shipping costs, inventory- 


ing costs, and handling costs associated with assigning demand area j 
to DC iin an “assignment cost” A};, as follows: 


Ai; = » (thdngyi(L — pr) + thdmciiPe 
hET,UT2UTS 
+ tidi; + hii + nhi) Djx. (5) 


If an LDC Z is involved, the assignment cost A?, is expressed as 


Aj = Yo (thdawi(l — pe) + thdmwipr + thdiy + his + hi) Dj 


+ ) (thdawi(1 — pr) + thdmyipr + thd + tid, 


+ hie + hie + mii + nie) Dip 


+ Yo (thdniy-(1 — pe) + thdmy-pr 
RET 


+ thd; + hi- + nv) Dye. (6) 


Finally, the cost Ajj, of assigning Type 4 product k at demand area j 
to DC tis as follows: 


Ady = (thdnyi(1 — pe) + thdmyipe + thdi; + hi: + nhi)Dj. (7) 
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In the event that demand area j cannot be assigned to a DC i or an 
LDC Z because j lies outside the operating radius of the facility (i.e., 
d;; > 6; or d,; > 6,), then the corresponding assignment costs can be 
set to an arbitrarily large number. 


2.4.3 Formulating the model 


Below, we provide the mathematical formulation for the problem. 
Then, the objective function and each of the constraints is explained 
in turn. The problem is formulated as follows: 


I 
min >) (ai + 6i(Bi + bigi))y; 
i=l 


I 
= 5; (a? + B2(B? + b2q?)) x Ziz 


=1 


I I 

+ ¥ cll —y,;) + 5 (i > a) 
i=] C= i=1 
o L 

+ >) D Alsi ( =) vs) 
i=1 j=1 7=1 
IJ oL 

2 x > } AieXijzWj 
i=1 j=1 7=1 
Eo 

a p> z Ab iRUi je 
i=1 j=1 RET, 
I 
+d filot)y: 
L 
+2 f2(02) y Ziv (8a) 

subject to 

I 

> «j= 1 for J=1,---,d, (8b) 

i=1 

xij<y for 1=1,---,IJ and j=1,---,d, (8c) 
Y w;j,<1 for j=1,---,d, (8d) . 
Z=1 
Wij S Xj j2i for 1=1,---,J and 
Z=1,:--,L and j=1,-:-,d, (8e) 

I 

Yz<1 for 7=1,---,L, (8f) 

i=1 


1714 TECHNICAL JOURNAL, SEPTEMBER 1985 


J L L og 
a= > S}xi;j ( =) un) + a) Sixj jw 
j=l /=1 /=1 j=l 
Z 
+ >») Yd Shuij, for i=1,---,J, (8g) 
RET, j=l 
of < Bi + big} for i=1,---,J, (8h) 
J 
ot = ¥ Siw, for 7=1,---, L, (81) 
ja 
ao = Be + 6202 for 7 = 1).- ++), (8}) 
Bi+ biqis B! for i=1,---,J, (8k) 
Bt + b2q2 <= B2 for 7 =1,.---,L, (81) 
oF Vik S Pr for RE fara (8m) 


u 


Uije <S Vx for j=1,---,J, and i=1,---,J and RE Ty, (8n) 


y vin = 1 for j=1,---,J and RET, (80) 
Vis Zizy Xij, Wei, Vir, Uijzn © {O, 1} for 1 = 1, ---, J, 
FH, etd, 
Ve = 1, BAe Sg L, and 
RET, (8p) 


qi, g7 € {0, 1, ---, ©} for t=1,---,JI and Z=1,---,L. (8q) 


The first two summation terms in the objective function (8a) rep- 
resent the setup cost incurred for open DCs and LDCs. The next two 
terms represent the closing costs that are incurred if a DC or an LDC 
is not open. In the fifth term, we include the assignment costs that 
are incurred if demand area j is assigned to DC i with no LDC involved. 
(Only in that case would x;; (1 — ©2-, w,;) = 1.) The sixth term, on 
the other hand, gives the assignment costs that are incurred if demand 
area j is assigned to DC i and LDC Z, and the seventh term considers 
the assignment costs for Type 4 products. The last two terms of the 
objective function represent the operating cost that is incurred at each 
open facility. 

Constraints (8b) ensure that each demand area j is assigned to 
exactly one DC, and constraints (8c) ensure that demand areas are 
assigned only to DCs that are open. Constraints (8d) permit each 
demand area j to be assigned to at most one LDC. The condition (8e) 
guarantees that such an LDC assignment is made only if the LDC is 
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open and is served by the same DC that serves the demand area j. 
Constraints (8f) ensure that each LDC is served by at most one DC. 
Constraints (8g) define o}, the space actually utilized at each DC i. 
This is obtained by adding the space required to serve demand areas 
that are assigned only to DC 1, plus space required for demand areas 
assigned to both DC i and an LDC, plus space required to serve any 
Type 4 products that are assigned to DC 1. In (8h), this space utilized 
is constrained to not exceed the capacity installed. Similarly, (8i) 
defines o2, the space utilized at LDC /, and (8j) constrains o2 to not 
exceed the capacity installed at that LDC. Constraints (8k) and (81) 
ensure that the capacity installed does not exceed the maximum 
permitted capacity for DCs and LDCs, respectively. In (8m), the 
number of DCs that serve each Type 4 product k does not exceed the 
permitted number p;. Constraints (8n) guarantee that a demand area 
receives each Type 4 product k from a DC that handles that product. 
Constraints (80) require that each demand area j be assigned to only 
one DC for a given Type 4 product. [In the event that no Type 4 
products occur in the problem, constraints (8m) through (80) do not 
appear]. Finally, conditions (8p) and (8q) enforce integer constraints 
on the variables. 

The integer program (8) is quite large. For example, for 10 possible 
DC locations, 10 possible LDC locations, 50 demand areas, and 1 Type 
4 product, there are 1640 variables and 6221 constraints. For a larger 
problem of 50 DCs, 20 LDCs, 100 demand areas, and 1 Type 4 product, 
there are 13,170 variables and 110,531 constraints. Nonlinearities 
_ appear in the objective function (8a) and in constraints (8e) and (8g), 
thus making it even more difficult to solve the program directly. 


Il. SOLUTION APPROACH 
3.1 Overview 


Because of the difficulty of the integer programming problem, the 
proposed algorithm contains some heuristic elements. In particular, 
we propose to first treat a simpler version of the problem and then 
adjust this solution with a series of heuristics to obtain a solution to 
the overall problem. 

The elements of the problem that are judged to be most important 
are considered in the initial optimization. Other elements that are 
considered, by comparison, less important or elements that are ex- 
pected to appear less often in actual case studies are treated by the 
secondary optimization. Specifically, the issues of Type 4 product 
distribution, LDC locations, and discrete facility sizing are set aside 
in the initial optimization. The initial optimization problem thereby 
becomes a type of capacitated facility location problem with concave 
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costs. After obtaining a solution to this problem, the solution is 
modified in a step-by-step fashion to incorporate, in turn, Type 4 
products, LDC locations and facility sizing. In the subsections that 
follow, we describe the various portions of the algorithm. 

Within our software implementation, there is the option to specify 
that certain variables be fixed in advance, for example, that certain 
DCs or LDCs be fixed open or closed, that certain demand area 
assignments be forced or forbidden, or that certain DCs be prevented 
from handling Type 4 products. The necessary modifications to the 
algorithm are generally straightforward, and therefore not discussed 
explicitly here. 


3.2 The preprocessor 


In Section 2.4.2 we described various DC and LDC space require- 
ment parameters and assignment cost parameters. These parameters 
are computed by the algorithm in a preprocessor routine. The impor- 
tance of the space requirement parameters lies in the fact that, when 
not considering Type 4 products, the original multicommodity problem 
becomes a single-commodity problem. The “commodity” in this case is 
warehouse space; the DCs have supplies of space and the demand 
areas require space. Further, the assignment costs (5) and (7) provide 
a convenient aggregation of shipping, inventorying, and handling 
costs. (Because of storage requirements, however, the assignment cost 
for assignments that use LDCs [eq. (6)] is calculated as needed.) 


3.3 The primary optimization 


In the primary optimization routine, we determine a set of DCs to 
be opened. Initially, we assume that all demand is served by DCs alone 
and that the maximum possible capacity B; is available at all open 
facilities. We associate a cost with each possible pairing of a DC i and 
a demand area j of the form 


C;; = Ai; + BiS}. (9) 


This represents the assignment cost for demand area j (as discussed 
in Section 2.4.2) plus a cost corresponding to the variable setup cost 
for the warehouse space required to serve j7. We also compute a “net 
fixed cost” 


F; = al _ cl, (10) 


which is equal to the difference between the fixed setup cost and the 
closing cost. 

At first, we also set aside the requirement that each demand area 
be served by only one DC. After first obtaining a solution without this 
restriction, we then will adjust the solution to enforce the restriction. 
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The initial problem is then the following capacitated facility location 
problem with concave operating cost: 


I I ew 
min ) Fyy + DY filo) + LY Y Cixi (11a) 
i=l i=1 i=1 j=1 
subject to 
J 
y Shxij =< B}y; for 1=1,---,J, (11b) 
j=l 
I 
Y xj =1 for j=1,---,d, (11c) 
i=1 
J 
ot = > Shxij for 1= i. Ve Ss I, (11d) 
j=l 
O<x;<1 for 1=1,---,J and j=1,---,J, (lle) 
y;€ {0,1} for 1=1,---, 0. (11f) 


Several algorithms for problems of this form have been pro- 
posed.’*!” We have implemented the iterative algorithm due to Kelly 
and Khumawala”’ that defines and solves a sequence of standard linear 
transportation problems. In these problems, the DCs are “sources” 
and demand areas are “sinks.” Linear costs on arcs from sources to 
sinks are based on the values of C;; and on the values of f}(c}) that are 
implied by the solution at the previous iteration. Key to the algorithm 
is the provision for a “dummy sink.” Incoming arcs to this dummy 
sink have negative costs based on the setup costs F; and the operating 
cost values f}(c}) at the previous iteration. Intuitively, this dummy 
sink offers to “buy back” the capacity at a DC. Costs on arcs into this 
sink are chosen from iteration to iteration in such a way as to 
discourage opening very underutilized DCs and to encourage taking 
advantage of the economies-of-scale in the concave operating cost. 
Complete details are given in Ref. 15. Within our implementation, the 
linear transportation problems are solved using primal network flow- 
convex,”! which is a state-of-the-art simplex network flow code. 

Upon solving problem (11), we obtain a set G of open DCs, each of 
which serves some group of demand areas. Some set U of demand 
areas will be served by more than one DC. Typically, in our compu- 
tational experience the numbers of such demand areas that have their 
demand “split” in this way among DCs is small. We next seek to 
resolve these splits and obtain a solution in which each demand area 
is served by only one DC. 

Within this phase, we momentarily leave fixed those demand areas 
that are assigned to only one DC in the solution obtained to (11). The 
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demand areas j € U whose assignments have been split are considered 
to be unassigned. The fixed assignments, on the other hand, take up 
warehouse space at the open DCs. The remaining available warehouse 
space at each DC 1 is denoted B,. 

To take the unassigned demand areas and assign them among the 
open DCs, we must, if possible, (approximately) solve this problem: 


min Dy » Cyxi (12a) 
iEG jeU 
subject to } xj =1 for j EU, (12b) 
ieG 
> Six; < B; for i€G, (12c) 
jEU 
x; €& {0,1} for ;E@U and ie€G. (12d) 


This is of the form of a generalized assignment problem.’*"! As 
described below, we first attempt to solve this using Lagrangian 
relaxation (see Ref. 11) and branch-and-bound techniques. Later, we 
add additional DCs to the set G in the event that (12) is infeasible. 

If we associate nonnegative Lagrange multipliers \;, 1 € G with 
constraints (12c) and incorporate these constraints into the objective 
(12a), we obtain the relaxed problem: 


min > > Cyxy + >. Ni ( by Six — a, (18a) 
ieG jEU iEG jeu 
subject to 
Yxj=1 for JEU, (13b) 
ieG 
xj €& {0,1} for 1E€G and JEU. (18¢) 


The relaxed problem can be solved by inspection. For each J, let index 
e be chosen so that 


Cy + eS} = min{ Cy + ASH. (14) 
i= 
(Ties can be broken arbitrarily.) Then, for each demand area J, 
lifi=e 
Xj = fori € G. (15) 


0 otherwise 


This is incorporated into an iterative scheme where the ); are increased 
from iteration to iteration if the solution obtained violates constraints 


(12c). This update is of the form 


Ni —_— vi +7 ( py Sixij.— B, ’ (16) 


jeu 
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where “<—” indicates “is replaced by” and 7 is a positive scalar. A 
formula for 7 is 


yZ 


(=o ae (17) 
12 | , Six - Bh | 
int Lj=U 


where vy is a positive scalar (in our implementation, y = 0.25 has been 
used successfully) and Z is an estimate of the difference between the 
optimal objective function value (12a) in the original problem and the 
current value (13a) in the relaxed problem. Within our implementa- 
tion, we obtain an estimate of the magnitude of (12a) based on 
arbitrarily assigning each j € U to one of the DCs to which it has a 
split assignment. The value of Z is then chosen to be 10 percent of 
this estimate of (12a). 

Similar updating rules appear in Held, Wolfe, and Crowder.” Sim- 
ilar relaxation techniques for generalized assignment problems are 
discussed in Fisher." 

If the Lagrangian relaxation problem (13) does not yield a feasible 
solution to (12) within a reasonable number of iterations (40 iterations 
have been used successfully in our implementation), then a branch- 
and-bound technique for (12) is initiated. A description of branch- 
and-bound algorithms for integer programming problems can be found, 
for example, in Ref. 23. We outline below the major features of the 
algorithm that we have implemented. 

At each node of the branch-and-bound tree, assignments for some 
of the demand areas j € U are assumed fixed. At each node, a 
Lagrangian relaxation problem, similar to (13), is solved for the 
unfixed demand areas. A lower bound on the optimal objective function 
value (12a), given the fixed assignments at the node, is obtained by 
evaluating the relaxed objective function (18a) at solution (15). Like- 
wise, another lower bound at the node can also be calculated by 
assuming that each unfixed demand area is assigned to the least costly 
DC. The branch-and-bound routine uses the maximum of the two 
lower bounds. If a feasible solution is found by Lagrangian relaxation, 
it is an upper bound. A very simple branching rule is used in which 
variables are chosen for branching in order of increasing j. Nodes are 
chosen for branching by the least lower bound value among most 
recently created unfathomed nodes. If the lower bound at a given node 
is within, say, 10 percent of a current upper bound, then no further 
branching is done from that node. 

If problem (12) is not feasible, an additional facility is chosen to be 
in the open set and then problem (13) is resolved with the DCs in the 
larger set G fixed open and all other DCs fixed closed. This additional 
new facility is chosen according to a rule that estimates the change in 
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assignment cost that would result from its opening. That is, for each 
DC i € G, compute 
Q;= Y min{max(C, — Cj, O)}. (18) 
jJ€U kEG 
Each term in this summation gives, for a demand area j, the minimum 
savings in assignment cost that would result if DC i were available. 
The DC with maximum QQ; value is chosen to be open and is thus 
added to G. (This rule is similar to the “largest omega” rule proposed 
by Khumawala® as a branching rule in a branch-and-bound algorithm 
for plant location problems.) The Kelly and Khumawala algorithm for 
problem (11) is then repeated with facilities 1 € G open and all others 
forced to be closed. 
In Fig. 2, we provide a flowchart of the primary optimization routine. 
This figure summarizes the procedures described in this subsection. 


3.4 Assigning Type 4 products 


As explained in Section 2.2, for each Type 4 product k, only a limited 
number p; of DC facilities are chosen to handle it. These facilities can 
be different for each k. We assume that Type 4 products occupy, at 
most, only a small percentage (perhaps 10 percent or less) of the 
warehouse space required. 

Given the demand area assignments determined by the primary 
optimization algorithm, each open DC i € G only has some amount 
B; of warehouse space still available. If demand area j is assigned to 
DC i for Type 4 product k, we associate cost 


which represents the assignment cost plus a share of the variable 
setup cost. 
Given the set G, the problem of choosing p; locations to serve 


product k can be formulated as the following capacitated p-median 
problem: 


J 
min ) )) Chause (20a) 
ieG j=l 
subject to 
DY vir S Pr, (20b) 
iEG 
» Uijr = 1 for J=1, ree yd, (20c) 
ieG 
UijR S Uz for J] = 1, ear J and i€ G, (20d) 
J 
»> Si Uijr s B; for 1€ G, (20e) 
j=l 
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SOLVE CONCAVE-COST, CAPACITATED 
FACILITY LOCATION PROBLEM (11) 


SET G = SET OF OPEN FACILITIES 
SET U = SET OF DEMAND AREAS 
WITH SPLIT ASSIGNMENTS 


APPLY LAGRANGIAN RELAXATION 
TECHNIQUE TO GENERALIZED 
ASSIGNMENT PROBLEM (12) 


FEASIBLE 
SOLUTION FOUND 
? 


APPLY BRANCH-AND-BOUND 
TECHNIQUE FOR (12) 


FEASIBLE 
SOLUTION FOUND 
? 


OPEN NEW FACILITY ACCORDING 
TO LARGEST {2 RULE. ADD 
NEW FACILITY TO SET G 


FORCE FACILITIES IN G TO 
BE OPEN, ALL OTHERS CLOSED 


Fig. 2—The primary optimization routine. 





Uijky Vik — {0, 1} for J= i; eee, J and 1€G. (20f) 


Without constraints (20e), this is the well-known p-median prob- 
lem.**”° The additional constraints (20e) are capacity constraints 
that take into account the amount of warehouse space that is actually 
available to serve product k. Problem (20) is solved for each Type 4 
product in turn. If there is more than one Type 4 product, the available 
capacities B; are updated each time (20) is solved. 

To solve problem (20), we apply a Lagrangian relaxation technique. 
Nonnegative Lagrange multipliers yu; are associated with constraints 
(20e) and unrestricted multipliers \; with constraints (20c). These 
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constraints are incorporated into the objective function to produce the 
relaxed problem: 


min »; 5 Chin + : dj o Uijk — : 
1E 


i€G j=1 


J 
a3 oy Mi (3 SF Ui = a) (21a) 


iEG = 
subject to 
» Vir < De, (21b) 
iEG 
Ur S Viz for J =1,-::-,J and 1€ G, (21c) 


Uijk, Vik e {0, 1} for J] = Is se eng, J and 1€G. (21d) 


The relaxed problem can be solved by inspection. For each DC 1, 
compute 


J 
= > min(C%, = vj + UiS se, O). (22) 
j=l 
This represents the contribution to the objective function (21a) that 
is possible if vz = 1. For given ); and y;, it is optimal in (21) to choose 
those DCs corresponding to the p; smallest values of R. (If vz, = 1, 
then it is optimal to choose u,, = 1 only if Cf, + dj; + uS}< O.) 

If these optimal values of u,;, satisfy constraints (20e) and (20c), 
then we obtain a solution to the original problem (20). If not, then the 
values of \; and y; are modified for the next iteration. If Vieg ugz > 1, 
then ); is increased, whereas if Yiieg Uijx < 1, Aj is decreased. Likewise, 
if YL, SAuijn > B;, then p; is increased. To sword oscillations, we do 
not allow values of y; to decrease. Thus, if 11 S4uj, < B;, the 
constraint is satisfied and y; is held fixed. Procedures for updating 
these multipliers are analogous to those described in Section 3.3. 

If a feasible solution is not found within a reasonable number of 
iterations (again, 40 has been used successfully), then a solution is 
generated based on the last set G, of R; values. The DCs corresponding 
to the p, smallest R; values are chosen to serve product k (i.e., for 
these DCs set v;, = 1). For each demand area j, we find index e such 
that 


Con + Meh: = min {Coy + wiShe}, (23) 
tEG, 


where the y; are also taken from the last iteration. Then, for each J, 
set 
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lifi=e 
Uuijrn = for1 EG. (24) 
0 otherwise 


No immediate modification of this solution is made, even in the event 
that this solution violates capacity constraints (20e). The solution will 
be adjusted to feasibility later in the facility sizing routine (see Section 
3.5). In the meantime, the amount of violation is small, since Type 4 
products are assumed to constitute only a small portion of the demand. 

The description of the algorithm above can be modified in a straight- 
forward way so that the special DCs are restricted to be chosen from 
among a predetermined list of DCs for each special product. If none 
of the facilities from the predetermined list were to appear in the 
solution obtained in the primary optimization routine, then the model 
would serve demand areas directly from the Type 4 product vendors. 
This possibility could be completely avoided by fixing one or more of 
the facilities on the list to be open a priori. 


3.5 LDC locations 


After a set of open DCs and demand area assignments are chosen, 
both for Type 4 products (Section 3.3) and other products (Section 
3.2), we attempt to introduce LDCs into the solution. First, we deter- 
mine a tentative set of open LDC locations and accompanying demand 
area assignments. With LDCs in place, utilized capacity in some DCs 
may now be decreased, thus allowing additional demand areas to be 
assigned. We then check if any “small” DCs can now be closed 
advantageously and their demand areas reassigned. 


3.5.1 Determining tentative LDC assignments 


In this routine, we first order the DCs in terms of decreasing 
throughput (i.e., amount of warehouse space occupied). For each DC 
in order, we perform an “LDC assignment procedure” to determine 
the set of LDCs that should be associated with the DC. We consider 
the larger DCs first, since they are more likely to serve a larger 
geographic region and, hence, more likely to benefit from the presence 
of subsidiary LDC locations. The following is the LDC assignment 
procedure for DC 1: 


Step 1. Let Q; denote the set of demand areas j that are assigned to 
DC i. Let E denote the set of possible LDC locations that have not 
already been assigned to another DC. For each j € Q;, determine the 
LDC e € E that satisfies 


A = min Ajy. (25) 
CEE 
For each such J € Q;, in turn, make a tentative assignment of j to e if 
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Az. < Aj, unless the assignment would result in the violation of 
capacity bound B?. If assigning j to a particular LDC e would violate 
the capacity bound, check other demand areas that are already as- 
signed to e; if a feasible solution with lower total cost can be obtained 
by removing another demand area from LDC e and replacing it with 
demand area j, then do so. Let P, denote the set of demand areas j 
tentatively assigned to LDC / at the end of step 1. 


Step 2. Consider those LDC locations to which demand areas j € Q; 
have been tentatively assigned at the end of step 1. Sort these LDCs 
in order of increasing throughput. Thus, underutilized LDCs, which 
are less likely to be cost-effective, are considered first. 


Step 3. Consider each LDC Z on the list in order. If the throughput 
due to tentative assignments is greater than some threshold amount 
(say, some fraction of the minimum capacity B?), leave its assignments 
unchanged. Otherwise, remove the LDC from the list, cancel the 
tentative demand area assignments to LDC /, and attempt to reassign 
the demand areas j € P,, if possible, to other LDCs. (That is, find 
another LDC é such that 

Aije = min Ajy. (26) 
f#e 


If A}; < Aj, tentatively assign j to é.) 

Step 4. Resort LDCs remaining on the list in order of increasing 
throughput. 

Step 5. For each LDC 7 remaining on the list, estimate the total 
cost for serving demand areas j € P, using DC i and LDC /. This 
estimate includes the assignment costs )\j<ep, Ai, plus the fixed setup 
cost a%, plus variable setup costs 8? >\jep, S}, plus the concave oper- 
ating cost f? (Yjep, S?), and minus the closing cost c?. Compare this 
with an estimate of the cost for serving demand areas j € P, using 
DC i alone. This estimate includes the assignment costs )'jep, Ajj, plus 
variable setup costs 6 Yjep, Sj, plus the difference in the concave 
operating cost function due to the additional throughput. If the cost 
using the LDC is less, then open LDC / and make the tentative 
assignments permanent. If not, then attempt to tentatively reassign 
the demand areas j € P, to other LDCs still on the list. O 

Obviously, more sophisticated procedures can be designed to decide 
assignments for LDCs. However, since it is expected that demand 
areas will only be assigned to LDCs that are relatively proximate, the 
potential number of economically attractive assignments is limited. 
Thus, more sophisticated procedures have not been found necessary. 

After completing this procedure for each DC 1 that is open, we have 
a tentative set of open LDCs and LDC assignments. 
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3.5.2 Closing small DCs 


We allow DCs with relatively small throughput to be closed and 
their demand areas reassigned. We first sort open DCs in a list in 
order of increasing throughput. In this way, the smaller DCs that are 
more likely to close will be considered first. For each DC i on the list, 
we then execute a “DC closing procedure,” which follows. 


Step 1. If the throughput of DC i exceeds some minimum threshold 
(say, half the minimum capacity B?), then leave it as is; go on to the 
next DC. If not, continue with step 2. 


Step 2. For each j € Q;, attempt to tentatively reassign the demand 
area to another DC. This reassignment should be to a DC that is 
feasible; that is, the DC should have sufficient spare capacity to handle 
the demand area, and the demand area should be within the radius of 
operation for the DC. If there is more than one such feasible DC, 
choose the one that minimizes the assignment cost for j. If, for some 
j © Q;, no feasible reassignment is possible, then cancel all tentative 
reassignments, keep DC i open, and go on to the next DC. Otherwise, 
continue with step 3. 


Step 3. Compare the additional assignment cost, operating cost and 
facility closing cost brought on by reassigning demand areas j € Q;. 
Compare this with the savings in setup cost for DC i. If it is advanta- 
geous to close DC 1, make the tentative reassignments permanent. 
Otherwise, cancel all tentative reassignments and go on to the next 
DC. O 


At this point, the set of open DCs and LDCs is determined. There 
remains only the question of sizing these open facilities, which we 
address in the next subsection. 


3.6 Facility sizing 


The model must determine the number q} of increments installed 
such that B} + biqi < Bi. At this point, there is a tentative set of 
demand area assignments that require amount o; [see (8g)] of ware- 
house space at each DC i. The simplest sizing routine would be to 
choose q} to be the smallest integer such that o} < B} + big}. There are 
two reasons why we might want to modify this approach: 


1. DC i is overcapacitated by an amount W; (perhaps because of 
assignments that were made for Type 4 products). 

2. By moving some demand areas to other DCs, we can perhaps 
install one less increment of space, thereby saving variable set-up cost 
Gib}. Suppose that if we reduced the requirements for warehouse space 
at DC i by W; units, we could install one less increment of space; call 
this value W; the “excess” space requirement. 
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The facility sizing routine attempts to adjust demand area assign- 
ments in order to allow certain facilities to be installed at a smaller 
capacity. It begins by ordering the DCs in a list. First, any overcapaci- 
tated DCs are entered in the list. Next, other DCs are entered in the 
list in order of increasing W;. For each DC on this list, in order, we 
then perform the following “DC sizing routine.” 


Step 1. For each demand area j € Q; find its next best “feasible” DC 
assignment and compute the associated assignment cost differential. 
(By feasible we mean that assigning the demand area to the DC would 
not cause certain capacity limits to be exceeded. If DC 1, the facility 
being sized, is overcapacitated, we take this limit to be the maximum 
capacity of the other DC; otherwise, we take it to be the capacity size 
required to serve the current assignments to the other DC. Assign an 
arbitrarily large cost differential if no other feasible assignment is 
possible.) At this point, consider only reassignments to a DC alone. 
The use of LDCs will be considered in step 5 below. 

Step 2. Sort demand areas j € Q; in order of increasing cost 
differential. 

Step 3. Tentatively reassign a sufficient number of demand areas 
from the top of the list so as to decrease the throughput at DC i by an 
amount greater than or equal to the excess W;. 

Step 4. If DC i is overcapacitated, make permanent the reassign- 
ments found in step 3. If not, check the cost differentials for the 
reassignments. If the sum of the cost differentials for the reassigned 
demand areas is greater than the savings 6} in variable setup cost, 
then cancel the reassignments. Otherwise, make the reassignments 
permanent. 

Step 5. If demand area reassignments were made permanent in step 
4, then consider the possible use of LDCs for each such demand area. 
In particular, if demand area j were permanently reassigned to a 
DC i’, examine those LDCs / that are now associated with DC 1’ (i.e., 
z= 1). Determine if total costs can be reduced by assigning demand 
area j to one of these LDCs. If so, assign j to the LDC that results in 
the minimum total cost. O 


After this procedure is completed, the values of g} (number of space 
increments for DCs) and q? (number of space increments for LDCs) 
are chosen to be the smallest integers that provide sufficient ware- 
housing space to handle the assigned demand areas. 


IV. IMPLEMENTATION DETAILS AND CONCLUDING REMARKS 


To provide an effective tool for decision makers, our model was 
designed to be flexible and efficient. Flexibility in the implementation 
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allows the user to analyze many different real situations using the 
same “generic” model. Further flexibility comes from an implementa- 
tion that permits some variables to be fixed a priori, thus allowing the 
user to impose various “nonquantifiable” conditions on the model. For 
example, various DCs or LDCs can be fixed open or closed. Certain 
DCs can be forbidden from handling Type 4 products; certain demand 
areas can be assigned a priori to a given DC or LDC. These conditions 
can be imposed to reflect some physical constraint or corporate policy. 
Such conditions can also be imposed after studying a previous solution 
obtained from the model. In this way, the model is “forced” to consider 
alternate solutions of interest to the user. The user may also wish to 
perform a sensitivity analysis in which the model is run several times 
with variations in one or more cost parameters. 

Efficiency of the algorithm is essential so that multiple runs as 
described above can be accomplished in a short time frame without 
excessive computation. The current implementation allows for a max- 
imum of 10 different products, 50 possible DC locations, 40 possible 
LDC locations, and 200 demand areas. Some average run times for a 
variety of problems that have been encountered in practice are given 
in Table J. (Problem I, in particular, was the basis for a nationwide 
distribution planning study.) These times were obtained on an Amdahl 
470/V8 operating under MVS; the code was compiled using the FOR- 
TRAN 77 compiler. All times are within an acceptable range for 
performing multiple runs in an economic study. Note that, as is typical 
in combinatorial optimization problems, run times can vary among 
different problems of the same size. For example, problem H is smaller 
than problem E, but took over twice the CPU time (262.2 seconds 
versus 115.6 seconds). (This variation is due primarily to the difference 
in the number of iterations required by the Kelly and Khumawala 
algorithm, which is used within the primary optimization procedure 
described in Section 3.3.) 

The many cost components considered (including nonlinear oper- 
ating costs), the ability to incorporate such features as different 


Table |—Average execution times 


Number of Number of 
Number of Demand Number of | Number of Special CPU 
Problem Areas DCs LDCs Products Seconds 
A 30 8 10 1 4.6 
B 54 8 25 0 111 
C 149 8 25 1 16.7 
D 54 48 25 0 61.2 
E 100 50 20 J 115.6 
F 100 48 25 0 126.6 
G 149 48 25 0 170.1 
H 100 50 0 0 262.2 
I 158 50 0 0 603.7 
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product types and capacity bounds and subsidiary warehouses, the 
flexibility offered by the ability to fix variables a priori, and the 
efficiency in run times make this model a unique and useful tool. It 
should provide genuine insight when used to study existing or proposed 
material logistics systems. 


V. ACKNOWLEDGMENTS 


Hanan Luss provided very valuable advice and support throughout 
the development of this model. His suggestions greatly improved the 
solution algorithm and its presentation here. Patty Donohue developed 
the user interface routines and provided essential help and support 
during the implementation and testing of the program. Dave Aff, Marv 
Berkowitz, and Steve Waters contributed to many helpful discussions 
concerning the model formulation and data requirements. 


REFERENCES 


1. R. L. Francis, L. F. McGinnis, and J. A. White, “Locational Analysis,” Eur. J. Oper. 
Res., 12 (March 1983), pp. 220-52. 

2. G. Y. Handler and P. B. Mirchandani, Location on Networks, Cambridge: M.I.T. 
Press, 1979. 

3. B.C. Tansel, R. L. Francis, and T. J. Lowe, “Location on Networks: A Survey, Part 
I: The p-Center and p-Median Problems,” Manage. Sci., 29 (April 1983), pp. 482- 


97. 

. M.A. Efroymson and T. L. Ray, “A Branch-Bound Algorithm for Plant Location,” 

Oper. Res., 14 (May-June 1966), pp. 361-8. 

. D. Erlenkotter, “A Dual-Based Procedure for Uncapacitated Facility Location,” 

Oper. Res., 14 (November-December 1978), pp. 992-1009. 
B. M. Khumawala, “An Efficient Branch-and-Bound Algorithm for the Warehouse 
Location Problem,” Manage. Sci., 18 (August 1972), pp. 718-31. 

. A. A. Kuehn and M. J. Hamburger, “A Heuristic Program for Locating Warehouses,” 

Manage. Sci., 9 (July 1963), pp. 643-66. 

. G. Cornuejols, G. L. Nemhauser, and L. A. Wolsey, “The Uncapacitated Facility 
Location Problem,” Manage. Sci. Res. Rep. No. MSRR 493, Carnegie-Mellon 
University, August 1983. 

9. J. Krarup and P. Pruzan, “Simple Plant Location Problem: Survey and Synthesis,” 

Eur. J. Oper. Res., 12 (January 1983), pp. 36-81. 

10. G. T. Ross and R. M. Soland, “Modeling Facility Location Problems as Generalized 
Assignment Problems,” Manage. Sci., 24 (November 1977), pp. 345-57. 

11. M. L. Fisher, “The Lagrangean Relaxation Method for Solving Integer Programming 
Problems,” Manage. Sci., 27 (January 1981), pp. 1-8. 

12. P.S. Davis and T. L. Ray, “A Branch-Bound Algorithm for the Capacitated Facilities 
Location Problem,” Nav. Res. Logis. Quart., 16, (September 1969), pp. 331-4. 

13. A. M. Geoffrion and R. D. McBride, “Lagrangean Relaxation Applied to Capacitated 
Facility Location Problems,” AJIE Trans., 10 (March 1978), pp. 40-7. 

14. R. M. Nauss, “An Improved Algorithm for the Capacitated Facility Location 
Problem,” J. Oper. Res. Soc., 29 (December 1978), pp. 1195-201. 

15. D. L. Kelly and B. M. Khumawala, “Capacitated Warehouse Location With Concave 
Costs,” J. Oper. Res. Soc., 33 (September 1982), pp. 817-26. 

16. B. M. Khumawala and D. L. Kelly, “Warehouse Location With Concave Costs,” 
INFOR, 12 (February 1974), pp. 55-65. 

17. R. M. Soland, “Optimal Plant Location With Concave Costs,” Oper. Res., 22 
(March-April 1974), pp. 373-85. 

18. A. M. Geoffrion and G. W. Graves, “Multicommodity Distribution System Design 
by Benders Decomposition,” Manage. Sci., 20 (January 1974), pp. 822-44. 

19. D. Erlenkotter, “A Comparative Study of Approaches to Dynamic Location Prob- 

lems,” Eur. J. Oper. Res., 6 (February 1981), pp. 133-43. 


ont DP ao B 


COMMODITY DISTRIBUTION = 1729 


20. H. Luss, “Operations Research and Capacity Expansion Problems: A Survey,” Oper. 
Res., 30 (September—October 1982), pp. 907-47. 

21. C. L. Monma and M. Segal, “A Primal Algorithm for Finding Minimum-Cost Flows 
in Capacitated Networks With Applications,” B.S.T.J., 67 (July-August 1982), 
pp. 949-68. 

22. M. Held, P. Wolfe, and H. Crowder, “Validation of Subgradient Optimization,” 
Math. Program., 6 (January 1974), pp. 62-88. 

23. H. M. Salkin, Integer Programming, Reading, MA: Addison-Wesley, 1975. 

24. S. C. Narula, U. I. Ogbu, and H. M. Samuelsson, “An Algorithm for the p-Median 
Problem,” Oper. Res., 25 (July-August 1977), pp. 709-12. 

25. S. L. Hakimi, “Optimal Locations of Switching Centers and the Absolute Centers 
and Medians of a Graph,” Oper. Res., 12 (May-June 1964), pp. 450-9. 


AUTHOR 


John G. Klincewicz, S.B. (Mathematics), 1975, The Massachusetts Institute 
of Technology; M.A., 1977, and Ph.D. (Operations Research), 1979, Yale 
University; AT&T Bell Laboratories, 1979—. At AT&T Bell Laboratories, 
Mr. Klincewicz is a member of the Operations Research Department. His 
research interests include applications of mathematical programming and the 
development of algorithms for network flow problems and facility location 
problems. Member, Operations Research Society, Mathematical Programming 
Society. 


1730 TECHNICAL JOURNAL, SEPTEMBER 1985 


AT&T Technical Journal 
Vol. 64, No. 7, September 1985 
Printed in U.S.A. 


A Priority-Based Admission Scheme for a 
Multiclass Queueing System 


By K. M. REGE and B. SENGUPTA* 
(Manuscript received November 21, 1984) 


We consider a queueing problem involving multiple priority classes where 
the station is divided into waiting and service areas. The service area has a 
finite number of positions where a customer of a particular class has access to 
only a subset of these positions. The admission into the service area is 
controlled by a mechanism that allows customers within a priority class to 
enter the service area on a first-come first-served basis. The customers of 
different classes are assumed to be indistinguishable once they have entered 
the service area. We consider service under three different disciplines: last- 
come first-served preemptive resume, multiple server, and processor sharing. 
We show that the waiting time of a customer is related to that of a customer 
in an equivalent M/G/1 queue. We characterize the Laplace-Stieltjes trans- 
form of the time spent in the service area. We also discuss three potential 
applications in the area of computer and communication systems. 


1. INTRODUCTION 


This paper is concerned with investigating a queueing system in 
which customers from n different job classes representing various 
priority levels receive service in a service area with a finite capacity of 
m. The capacity, m, of the service area refers to the maximum number 
of customers that can be present in the service area at any time. We 
describe an admission scheme that allows preferential access to the 
service area by the higher-priority customers. This scheme may give 
rise to a smaller waiting time before entry into the service area for the 
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higher-priority customers. The customer classes are assumed to be 
indistinguishable after they have gained access to the service area. 
The performance measures that we characterize are the distributions 
of waiting time and the mean time spent in the service area. This 
work has potential applications in the design of multiprogramming 
levels for computer systems and the design of window levels for 
communication systems. 

More formally, let n independent classes of customers arrive at a 
queueing station, each according to a Poisson process. Let the mean 
arrival rate of class i be \;. The classes are arranged according to 
decreasing order of priority, that is, class 1 has the highest priority 
and class n has the lowest priority. The queueing station is divided 
into n waiting areas (one for each class) and a service area. The service 
area can hold at most m customers simultaneously. Service is provided 
at a state-dependent rate of 1; whenever there are i customers present 
in the service area. We consider three service disciplines within the 
service area: Last-Come First-Served Preemptive Resume (LCFS-PR), 
m Server (MS), and Processor Sharing (PS). The admission into the 
service area is controlled by means of a gate that allows customers 
from the waiting areas to enter the service area on the basis of the 
contents of the service area. The admission policy gives preferential 
treatment to higher-priority customers in gaining access to the service 
area. In particular, the admission policy is governed by two rules: 

1. When a customer of class i is admitted to the service area, the 
waiting areas of classes 1, --- , i — 1 must be empty. 

2. When a class i customer enters the service area, the number of 

customers in the service area (excluding itself) must be less than k;. 
The sequence {k;, 1 = 0, ---, n + 1} is a set of strictly decreasing, 
nonnegative integers with ky = ©, k; = m, and ky4, = 0. 
This admission scheme reserves some slots in the service area for the 
exclusive use of the high-priority customers. In the case of the MS 
model, this means that at times the low-priority customers will not be 
allowed to enter the service area although some servers are idle. 
Obviously this is not the most efficient way of utilizing the servers’ 
capacity. However, when the designer’s overriding concern is to reduce 
the delays suffered by the high-priority customers, it is useful to 
reserve certain slots exclusively for the high-priority customers. The 
queueing station is shown schematically in Fig. 1. 

In this paper, we show that this problem has an interesting structure, 
which can be exploited to characterize (1) the waiting-time distribu- 
tion, by class, and (2) the mean time spent in the service area, by 
class. For the special case of the PS discipline, we show how to obtain 
this mean when n equals two. Since writing this paper, it has been 
brought to our attention that Schaack and Larson’ have independently 
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Fig. 1—The queueing station. 


studied the special case of the MS discipline and reported the same 
results as we do here. 

This paper is organized into four sections. In Section I, we discuss 
some potential applications for this model. In Section II, we derive 
the waiting-time distribution, by class. We characterize the mean time 
spent in the service area, by class, in Section III. Section IV summa- 
rizes our conclusions. 


Il. POTENTIAL APPLICATIONS 


We discuss three potential applications in this section. In the first, 
we propose this scheme for sharing of multiprogramming threads by 
several job classes in a computer system. Our second and third exam- 
ples propose this admission scheme for sharing a window size by 
several job classes at the link level and the application level, respec- 
tively, in a communication system. 

Avi-Itzhak and Heyman’? had first proposed that a state-dependent 
server be used to approximate the CPU and disk subsystem of a 
computer. We depict a multiple CPU and disk subsystem in Fig. 2, 
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Fig. 2—A computer with shared multiprogramming threads. 


which is approximated by a state-dependent server in our model. The 
service rate yu; of our model is obtained by solving for the throughput 
in the closed queueing network of Fig. 2 with a population size of 1. It 
is assumed in our model that the service requirements in terms of 
CPU and I/O times are approximately the same for all classes of 
customers. Also, the service discipline for the state-dependent server 
that is most appropriate for this application is PS. The closed queueing 
network of Fig. 2 can be solved by mean-value analysis described by 
Reiser and Lavenberg.® The circumstances under which a single state- 
dependent server is a good approximation of the CPU and I/O subsys- 
tem was investigated by Fredericks.* This approximation is usually 
good when each customer makes many trips to the I/O devices and 
when the CPU and I/O times required by a customer are not too 
unbalanced. In our model, m is the multiprogramming level of the 
computer, usually determined by considerations such as available 
memory and the extent to which the jobs require concurrent access of 
the same databases. Given m, our model can be used to determine a 
way to allocate available multiprogramming threads to the various job 
classes so that some requirements on mean response time can be met. 

A second application would be in the modeling of a link layer 
protocol such as high-level data link control. Assume that a provider 
of packet-switching service offers n grades of service, each with its 
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own response-time requirement. The different grades of service may 
be provided by appropriately sharing a link-level window size of m 
among the packets of n service grades. The admission scheme proposed 
in the Introduction is one means of offering different levels of service 
to customers. In this application, the queueing station represents a 
node in the network where the customers in the service area correspond 
to the jobs ready for transmission. Service provided to a customer 
constitutes its transmission to an adjacent node and the return of the 
acknowledgment. Since data links are often characterized by relatively 
low utilizations, the value of 4; may be approximately proportional to 
1, at least for small values of 1. The constant of proportionality may 
be taken as the mean round-trip time to receive an acknowledgment 
on a link that has no traffic. This linearity would imply that there is 
hardly any wait for transmission to commence once a packet has 
entered the service area. For larger values of i, some saturation of p; 
will take place as the presence of a large number of packets in the 
service area starts to choke the capacity of the link. The limiting valve 
of uw; may be chosen as the rate at which acknowledgments can be 
returned in a fully utilized link. We show the closed queueing network 
used for calculating y; in Fig. 3. This is an approximation along the 
lines of one proposed by Schwartz.° 

The third potential application is from the point of view of a user 
of a data network. This potential application is similar in spirit to the 
previous one except that we are concerned with the high-level protocol 
of host-to-host traffic using a data network. The network itself is 
approximated by a state-dependent server. The closed queueing net- 
work used for calculating y; is shown in Fig. 4. This approximation 
was proposed by Reiser.*’ The user of the network must allocate a 
window size of m to n different types of traffic. The admission scheme 
described earlier may enable the user to determine a way to share the 


cama 


WINDOW SIZE =m 


Ay - ||| ae) 


ofl 


Ni ——— | | ACKNOWLEDGMENTS 


Fig. 3—Link layer protocol. 
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Fig. 4—High-level protocol. 


window size among the types of traffic to meet certain response-time 
criteria. 


Il. THE WAITING-TIME DISTRIBUTION 


In this section, we will characterize the waiting-time distributions 
without assuming anything about the service discipline. The key result 
of this section is that, given that a customer of class / is required to 
wait, its waiting-time distribution is related to that of a suitable 
M/G/1 queue. 

The state of the system is completely specified by an n-tuple (J1, 
Jo, +++ ,di, +++ ,dn), where J;(i = 2, --- , n) represents the number of 
customers of class 1 waiting and J, represents the number of customers 
of class 1 waiting plus the number of customers present in the service 
area. With this description of the state space, one can, in principle, 
write down a system of equations for the steady-state probability 
vector P(j) = P(J1 =)i, Jo = Je, +--+, Jn =Jn). In particular, for / = 0, 

-,n—-1l 


n l 
(A + j,)PQ) = ere AnP(j — €x)6(je) + 2X AnP(J — €1)6(J1) 


+ pjyiP (J + €1) + wj,PG + err) — 6(71 — Rnr))6(), (1) 


where 
Rua S ji < Ri, 


A= y Ak 
k=1 


6(x) = 1 if x > 0 and 0 otherwise, and e,; is an n-tuple with a 1 in the 
kth position and 0 elsewhere. For j; = m, pj, is to equal to un(=p), 
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since for j; = m, the number of customers in the service area is m. The 
solution of this equation is not easy; however, it is possible to solve it 
for the case where n = 2. Since the solution of this equation does not 
concern us at present, we show how to calculate this for n = 2 in 
Appendix A. 

We will now concentrate our attention on the stochastic process 
defined by the random variable J;. Let {u;} be the steady-state mar- 
ginal probability distribution of J,. Since arrivals are Poisson, a 
customer of class | is required to wait outside with probability 
Djee, uj. Clearly, the waiting time is 0 whenever an arrival finds that 
J, < k;. Let us now start observing the system when J, changes its 
value from k; — 1 to k;. Let t) denote this instant. At t), a customer of 
type j, with j < J, arrives to find exactly k; — 1 customers in the service 
area and is immediately admitted for service. Let ¢; denote the first 
instant after t) when J; moves from k; to k; — 1 with no type | customers 
waiting outside. During the open interval (to, t;), several type / cus- 
tomers may get admitted to the service area. If n(n = 0) type 1 
customers are admitted to the service area during the open interval 
(to, ts), let ti, te, --- , t, denote instants when these admissions took 
place (refer to Fig. 5). Note that at the instants t,, th, ---, th, a 
departure occurs from the state J; = k; and there is at least one type 1 
customer waiting outside. Also, at these instants there cannot be any 
higher-priority customers in the waiting area. Now let us focus our 
attention on the intervals (to, t), (t1, t2), «++ » (tn-1, tn) and (tn, t;). [In 
case n equals 0, we need consider just one interval (to, t;).] The lengths 
of these intervals are governed by the customers inside the service 
area, which by assumption are indistinguishable, and by arrivals of 





Fig. 5—A typical sample function of the process J;. (The notches at ¢, and t, represent 
the event that a departure occurred from the state J; = k; and a waiting type / customer 
was immediately admitted to the service area.) 
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customers of priority higher than that of type / customers (i.e., the 
types of these customers are less than /) that are Poisson. Also, since 
the service requirements are memoryless and since the end points of 
these intervals are marked by identical states so far as customers of 
priority higher than that of type / customers are concerned, the lengths 
of these intervals are independent and identically distributed (i.i.d.) 
random variables. Let H; denote the generic random variable corre- 
sponding to these lengths and let H)(s) denote the Laplace-Stieltjes 
Transform (LST) of its distribution function. 

Now a type / customer is required to wait if it arrives during an open 
interval similar to the interval (to, ts) described in the previous para- 
graph. Since the interadmission times for type | customers during such 
an interval are i.i.d. random variables with LST of distribution func- 
tion H,(s), it follows that the waiting-time distribution of a type / 
customer, given that it is required to wait, is identical to that of a 
customer in an M/G/1 queue [with arrival rate \,; and LST of service- 
time distribution H;(s)], which arrives to find the server busy. Let 
W,(s) denote the LST of the waiting-time distribution of a type l 
customer given that it is required to wait. Then, from page 223 of Ref. 
8, we have 


(1 — Ai(s))( = py) 


MGS Gah # MH) ECD oe 
where 
p= NE(A) 
and 
=2,...,n. 


The LST of the unconditional waiting-time distribution for a customer 
of type / is given by 


Rj-1 00 
y ust Wis) d uj. (3) 
j=0 jah, 


The waiting-time distribution of a class 1 customer is easier to 
characterize. Given that a class 1 customer has to wait, its waiting 
time is the same as the sojourn time in an M/M/1 queue where the 
server is working at a rate of un(=n, say). So, 


wHo-A 
W. =e 4 
10 eres eer (4) 
and the unconditional waiting-time distribution is given by 
Rky-1 [o) 
>) uj “f- W,(s) >, uj. (5) 
j=0 jh, 


1738 TECHNICAL JOURNAL, SEPTEMBER 1985 


In the remainder of this section, we will show how to calculate u; 
and H,(s). Let us define a random variable B; to be the elapsed time 
from the instant J, changes from k; — 1 to k; until the next instant 
when the value of J; drops from k; to k; — 1 and J; = 0. In other words, 
B, is the length of an interval similar to (fo, t;) discussed earlier. From 
the preceding discussion it should be clear that B; constitutes a busy 
period of an M/G/1 queue with arrival rate \; and the LST of service- 
time distribution H;(s). Let B;(s) be the LST of the distribution of B). 
Then B,(s) and H;(s) are related by (see page 212 of Ref. 8) 


Bi(s) = Hils + \; — A.Bi(s)]. (6) 
Since B,(s) represents the busy period of an M/M/1 queue, 


BM + At +s- [(u + At + s)? = Aud]? 


B,(s) = Dr 
1 


(7) 
from page 215 of Ref. 8. 

Next, we define the random variable C; to denote the first passage 
time from the state J; = j to J; =j — 1, where ky. >j > k;. Let C;(s) 
be the LST of the distribution of C;. It should be clear that the waiting 
customers of class l, --- , n play no role in determining this first 
passage time. Further, J; = J implies that Jo = J3 = --- = J; = 0. 
From these observations, now it is possible to write down the following 
equations for C;(s): 


1 
Ci(s) = ao! (Ai Cj41(s)Cj(s) + py) 
for J=k +1, ---, Rea — 2; 
1 
Cy,_,-1(8) = (<4) (A;Bi-1(s)Cy,_,-1(8) + pe,.-1)3 
1 
Hi(s) = (i) (AiCra1(s)Ai(s) + ur,); 
and 
l-1 
Ay= Yo Xe for T=] 2,285 nm (8) 
k=1 


Thus, eq. (8) defines a recursive technique for obtaining H,(s) from 
B,-1(s) via the functions C;(s) for ki; >j > k,. By using eq. (6), one 
can obtain B;(s) from H)(s); and eq. (7) provides the value of B,(s), 
the boundary for eq. (8) when / = 2. 

Finally, we show how to characterize u; for j] = 0, --- , which is the 
steady-state marginal distribution of J,. To do this, we define n Semi- 
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k,+ mb C 
k 1 _—— VOTTTTSL SS SS VI1111111077s 


(IGNORED) (IGNORED) 


Fig.6—Sample function of the [th SMP derived from the sample function of J; 
shown in Fig. 5. 


Markov Processes (SMP), where the [th SMP has states k,, --+ , Ry-1, 
where | = 2, --- ,n + 1. The state of the /th SMP is the realization of 
the random variable J; with two differences. When J; > k;-1, we will 
assume that the state of the SMP is &,;. Further, we will simply 
ignore the times when J; < k;. A typical sample function of the 
Ith SMP shown in Fig. 6 may help illustrate the structure of these 
SMPs. The transition probability for the SMP from state j toj + 1 is 
Ai/(A; + pj) and j to j — 1 is uj/(Ar + pj), where kj <j < k1. The 
holding time in state j(k; <j < k-1) is exponential with rate A; + y;. 
For state k;-,, the holding time is B; and this state makes a transition 
into state k,; — 1 with probability 1. State k, makes a transition into 
k, + 1 with probability 1, and the holding time is exponential with rate 
A;. In the description of the SMP, only one statement needs clarifi- 
cation, that is, the holding time in state k,. Since we are ignoring all 
times when J, < k;, this is equivalent to ignoring the transition of J; 
from k; to k; — 1. The result follows from observing that the transition 
from k; to k;} + 1 occurs at an exponential rate of A). It is relatively 
easy to solve these n SMPs using methods described in Ross.’ Let 1 
be the steady-state probability of state j in the lth SMP (/ = 2, ---, 
n+1;j=k,, ---,R-1). Then it should be clear that 


foe) 


P(jy =J| Ji = ki) = y; Y ui = Ty 


i=k, 


for J=hk, +>, Ruy -1 (9) 


and 
P(A zhkalizk= Y u/YDux= Thy_4l- 
i=ky_y i=k, 
It is easy to use (9) to calculate uj(j = 0, --- , m — 1) recursively, 


starting with the solution of the (n + 1)st SMP and working backwards 
through to the second SMP. For j = m, the server always works at a 
rate of u,,(=), and the random variable J, behaves like the number 
in the system for an M/M/1 queue. Thus, we have 
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m-1 
uj = (1 — Aa/p)(a/n)7™ ( a> u) 
j=0 
for j] =m. We note at this point that the probability that the system 
is idle is given by Up. 


IV. THE TIME SPENT IN THE SERVICE AREA 


In this section, we describe methods of obtaining the mean time 
spent in the service area by class for the LCFS-PR and MS disciplines. 
For the PS discipline, we describe a method for characterizing the 
means when n = 2. 


4.1 The LCFS-PR discipline 


In this discipline, we will assume that on entry into the service area, 
a customer occupies the lowest-numbered service position that is 
empty. Further, the server renders service to the customer in the 
highest-numbered service position that is nonempty. Thus, a customer 
occupies the same service position from entry until departure. Let 
T;(s) be the LST of the distribution of time spent in the service area 
by a customer who occupies position j on entry into the service area. 
Then, 


Tr,(s) = n/(u + 8) 
and 


C;(s) if Rye >j> k; 
T;(s) = and 1[=2,.---,n. (10) 
H,(s) if J = k; 


It is now easy to use the results of Section II to obtain this LST or 
any other characterization of this distribution. 


4.2 The multiple-server discipline 


In this discipline, let 4; = ic. Then the time spent in the service area 
is simply exponential with parameter o. 


4.3 The processor sharing discipline 


The exact solution for the mean time spent in the service area can 
be obtained by first noting that customer classes are indistinguishable 
on entry into the service area. We denote the state of the system by 
an n-tuple (Jj, Jo, --- ,J,) as seen by a customer of class / after entry 
into the service area and let Q,(j) denote the probability that the state 
of the system is j when an arbitrary class / customer is admitted to 
the service area, where j,; includes the newly admitted customer. 
Further, let x(j) be the mean time spent in the service area for a 
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customer who sees state j immediately on being admitted to the service 
area. If we let ¢, be the mean time spent in the service area by a class 
/ customer, then 


i= Y x(a), (11) 
JEA 


where 
Ar = {(/1, Joy «++ Jn) (Je =O for2sk<l} for 1=1,---,n. 


It is possible to obtain Q,(j) from the following observations: 
1. For j; < k,, 


Q.(j) = PG — e1). 


For a customer of class | to see j; including itself, there must be j, — 1 
ahead of it in the service area. 

2. Whenever a customer of class / enters the service area, Jz = J3 = 
oe =J)-1 = QOandd, < k. 

3. For Jj; = k;, one of two disjoint events must occur. Either the class 
l customer arrived to see k; — 1 customers ahead of it in the service 
area or it must have waited in the waiting area prior to admission. 
The former case is identical to the first observation above. In the 
latter case, we have to characterize the distribution of (J), --- , J,) at 
the time of entry into the service area. The random variable J; behaves 
like the number waiting as seen by a customer about to enter service 
given that it had to wait in an M/G/1 queue with an arrival rate of i, 
and a service time of H;. The distribution of J,(k =1+ 1, ---,n) is 
simply the convolution of what was seen on arrival and the number of 
new arrivals of type k during the wait of the customer of class l. 

In principle, it is possible to write down Q,(j) in terms of P(j) from 
the observations made above. The notation is cumbersome, so we will 
not go into the details here. The exact derivation when n = 2 is given 
in Appendix B. 

Further, for kj41 Sj; < k, the x(j) satisfy 


n l 


Anx(j + en) + DY Agx(j + e1) 
i 


=141 k=1 


(A + yj,)xQ) = 1+ P 





- Ge ) (jr) oy, (x — e@,)6(J1 — Rui) + [xQG — e1)(1 — 6(Ju1)) 


+ xj — e141)6(Jrrr) 1 — 6(1 — Ris1))}. (12) 


The solution of this equation is not easy; however, it is possible 
to solve it for the case where n = 2. We present this solution in Ap- 
pendix C. 
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V. CONCLUDING REMARKS 


In the earlier sections, we have shown how to characterize distri- 
butions of the waiting time and the time spent in the service area. Of 
interest in many applications would also be the sojourn time (i.e., the 
elapsed time between arrival into and departure from the system) of 
customers. In principle, it is possible to characterize the sojourn time 
distribution for the two-class PS problem by the methods used in Ref. 
10. 

We note that the time spent in the service area for the first-come 
first-served discipline is somewhat difficult to characterize, whereas 
the results for the waiting time is the same as that in Section II. The 
reason for this difficulty can be seen by first assuming that we are 
about to characterize a two-class problem. Then the mean time spent 
in the service area has to be found conditioned on J,, Jz and the 
position of the tagged customer in the service area. The difference 
equations for this mean time thus will be in three variables and are 
hard to solve. 
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APPENDIX A 
The Steady-State Probabilities for a Two-Class Problem 

In this appendix, we show how to obtain the solution to eq. (1) when 
n = 2. For the sake of notational ease, we will refer to the random 
variables J, and Jz as J and J in this and the other appendices. Further, 
t and j will be the realizations of the random variables J and J, 
respectively. Let P;; be the steady-state probability that J = 71 and 
J = J. Equation (1) reduces to the following equations when n is 2: 
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(Ar + Ao + wi) Pip = A Pina,j + A2Pi jer + mit Pits; 
if 1>k,, j>0, (18) 
(Ar + Az + wi) Pio = A1Pi-10 + AzPi-10 + mi+1 Piss, 
if O<i<ky, (14) 
(Ar + Az + oa) Pro = (Ar + Az) Phy-1,0 + bhyPhept + Mhyt1Pr+1,0 (15) 
(Ay + Ao + wi) Pio = ArPi-to + mitiPitio if 0 > ke, (16) 
(Ar + Ae + Min) Proj = A2Phyj-1 + My Phy jtt 1H Mgt, Phy.) 
if j>0 (17) 
and 
(Ar + Az) Poo = 41Po- (18) 


It should be clear that up (of Section III) is the same as Poo. So, one 
can obtain Pi» for 0 S i S ky by using (18) and (14). Even though the 
coefficients of P;; in eqs. (13) and (16) depend on i, it is easy to see 
that these are constant coefficient partial difference equations when 
1 = m. Further, these equations are very similar to the ones solved by 
Rege and Sengupta.’° Using these methods, it is easy to show that 


Pig = Boot (19) 
and 
Py = A ( DY of *Pyj-at Y ob "Pyj-a + Byo'] 
(o1 = 02) v=itl v=m 


for 1=mandj>0, (20) 


where 
o1 = (1 + pr + p2 + V(1 + pi + pr)” — 4p1))/2, 
o2 = (1 + pi + p2 — V(1 + pi + pr)” — 4p1))/2, 
P1 = A1/u, po = Do/n, 
and {B;,j = 0, 1, ---} constitute a sequence of unknown constants to 


be determined from the boundary conditions (15) and (17). 

Now we will show how to determine the unknown constants in two 
steps. First, we will show this for Bp and then for B;. It is possible to 
determine By by assuming two trial values and using (19) and (16) to 
recursively calculate two sets of Pj for i= m+ 1, m, ---, Ro. Since 
each of these Py is a linear function of the unknown constant Bo, now 
it is easy to use linear interpolation to obtain the correct value of Bo 
that agrees with P,,o already obtained from (14). One can now use 
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(15) to obtain P,,,. To calculate B;(j > 0), let us assume the P,,,; has 
been obtained from (15) for 7 = 1 or from (17) for j > 1. Further 
assume that P;,(0 Ss k <j — 1 and all i) are known. As before, we start 
with two trial values of B; and recursively calculate two sets of P; for 
1=m+1,---, ke by using (20) and (13). Since each of the P, is a 
linear function of B;, we can use linear interpolation to obtain the 
correct value of B; that agrees with P,, ; already obtained from (15) 
for j] = 1 or from (17) for j > 1. Finally, one can use (17) to obtain 
P ko, j+1+ 


APPENDIX B 
State Probability As Seen by Customers on Admission to the Service Area 


Here we describe the procedures for computing Q, (i,j) and Q2(i,j) 
that denote the state probabilities as seen upon admission to the 
service area by type 1 and type 2 customers, respectively. The proce- 
dure for computation of Q2(i, 7) is described first. 

It is clear that if, on arrival, a type 2 customer finds less than k 
customers in the system, it does not have to wait before entering the 
service area. So, 


Q2(,j) = Pl -1,j) for 1<kp. (21) 


It is also obvious that Q2(1,j7) = 0 for i> ky, since a type 2 customer 
cannot enter the service area if the number of customers in the service 
area other than itself is greater than or equal to k,. A type 2 customer 
will see J = k, upon its admission to the service area in one of two 
ways: (1) if there are ky — 1 customers in the system just before its 
arrival, or (2) if J => ky at the time of its arrival and it has to wait until 
all type 2 customers ahead of it in the waiting area are admitted to 
the system and a departure occurs from the state I = kp. 

From the analysis of Section II, W2(s) is the LST of the distribution 
of the waiting time of a type 2 customer given that it has to wait. The 
generating function of the number of arrivals of type 2 during this 
wait is W2(Ao(1 — z)). Further, the probability that a type 2 customer 
has to wait before entering the service area is zz, Mi. 

Thus, 

J J <= = 
(d?/dz’) Wo(r2(1_ — 2)) e=0 Yu: (22) 


Q2(Re, J) = 5; Pry-1,0 te r 
J: J=Ry 


where 6; = 1 if j = 0 and 0 otherwise. 

To compute Q, (i,j), we note that type 1 customers do not have to 
wait if there are less than k, customers in the service area at the time 
of their arrival. Thus, 


Qil4j)=P@—-1,7) for i<k. (23) 
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For i> ky, 


oy 
Aawgj=X% dL Pwss) 
=k, j’=0 
. * put) ge Ont) ny (dat)? 
0 (u’ — k,)! (i — ky)! 5 hee ie) 
For 1 = k,, there are two possibilities: (1) if the type 1 customer 
finds k,; — 1 customers in the service area upon arrival and does not 


have to wait, (2) if it has to wait but no type 1 arrivals occur during 
its wait. Thus, 


Qk jy=Pl-Liy+ & LY PWw,s’) 


edt. (24) 


=k, j’=0 

PMC cae NGL) Son 
‘ Ayre 8 ee edi, (25 
J (i’ — ky)! (py) ee 


In deriving (24) and (25) above, we have used the facts that the 
waiting time of a type 1 customer (given that it has to wait) has a 
gamma distribution and that the type 1 and type 2 arrivals that occur 
during this wait are independent and Poisson. 


APPENDIX C 


Characterization of the Time Spent in the Service Area by a Tagged 
Customer in a Two-Class Processor Sharing System 


Let X(s) denote the LST of the distribution of the time spent in 
the service area by an arbitrary “tagged customer.” Similarly, let X; ;(s) 
denote the LST of the conditional distribution of this random variable 
given that the state J of the system at the time the tagged customer 
was admitted to the service area was (i,j). (Here i is assumed to 
include the tagged customer.) Let X;(s) denote the row vector with 
entries 

{X;(s)}; = X;;(s) for 1=> Ro, J > 0. (26) 


When 1 < kp, there can be no type 2 customers waiting outside, that 
is, ] = 0. So we let X;(s) denote the quantity X;(s). We are interested 
in determining the quantities X;(s) for 1 = k2 and X;(s) for 1 <i< kp. 

Assume that the tagged customer was admitted to the service area 
at time 0 and let 7 denote the time at which the tagged customer 
finishes service and quits the system. Then, 


{X.(s)}; = Ele“? | Joe = (i, j)] for i=hke, j=0,1, ++, 
and 


Xs) = Ele? | Jo = (i, 0)] for i<isko—-1. (27) 
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Let T; denote the first passage time (after time 0) into the state (1, .), 
that is, 


T; = Min{t:t = 0, J, = (i, .)} for i121. (28) 
Also, for i = kg andj, k= 0,1, 2, --- , let ;Rj,(s) denote the quantity 
Eles™ UT > Taj lJIr+, = (i -— 1, 7)}| dor = (i, RDI, 
where 1{A} denotes the indicator function of the event A. Observe that 
for 1 = m the server continues to work at the maximum multiprogram- 
ming level until the first passage time into the state (i — 1, -) so that, 


sample path by sample path, the above expectation is independent of 
i. Thus, 


iRj»(s) = Rjx(s) for 1=m. (29) 


Also, during this time no type 2 jobs are admitted to the service area 
so that j cannot be less than k, that is, 


Rls) =0 for j<k. (30) 
Let R(s) denote the matrix with entries 
{R(s)}ir = Rj (s) for J k= 0, ibs Ds soe, (31) 


Then R(s), clearly, is lower triangular. Moreover, by a sample path 
argument it can be shown that R;;,(s) depends upon the difference, 
j — k, which represents the number of type 2 arrivals during (0, T;-1), 
and not on k, which refers to the number of type 2 customers waiting 
outside at time 0*. Thus R(s) has the form 


ro(s) 
ri(s) ro(s) 0 


R(s) =] re(s) ri(s) ro(s) . (32) 


C.1_ A matrix equation for R(s) 


By conditioning on the first event to occur, we write R;,(s) as 


_ p(m = 1)/m Ay —sT;-, $ 
Rye (s) aera? aes j,k Leake Ele UT > Ta} 
. P ; A2 
-l aa — _ +. Se Sah —sT;-1 
(Irz, = (O- 1, J) ldo =O + 1, ci a 


UT > TYyUIre, = @- LI) ldo = GR + DI], (33) 


where the first term corresponds to the event “departure of a nontagged 
customer,” the second to the event “arrival of a type 1 customer,” and 
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the third to the event “arrival of a type 2 customer.” Now the 
expectation in the second term can be written as 


Ele YT > Tall{Ire, = GG - 1, 7) | Jor = E+ 1, BD] 


oo 


= y Ble Pur > jae = Yuga es pepe = (i os 1, j)} 
j’=0 
AfIr = (i, 7’)} [Jo = +1, k)], (84) 


where 7” is the first passage time into the state (i, -) and T” is the 
time elapsed since the first passage into (i, -) until the first passage 
into the state (1 — 1, -). Clearly, T;_, = T’ + T” since the Markov 
process J is skip-free. From the Markov property of J7 it follows that 
the right-hand size of (34) equals 


oO 


Y Eee" ut — T > TUS rem = (i - 1, J) Ire 


j’=0 
= (i, j’), T> T’'] X Ele" UT > TUS e+ = (i, j’)} | Jor 
= (i + 1, R)], 
which is nothing but 


x Rj, (s) Ryn (s) = {R?(s)}ip- 
j= 


Thus, (34) can be written as 
u(m — 1)/m v 


a ee a 9 2 OY 
re waar, (s)hix 
de 


+ ———_—__ 
Atpts 


Rjx(s) = 


Ry p+ (s) (85) 


or in a matrix form 


_ wm — 1)/m) A 2 Az 
Atuts ae Oe (36) 


In (35) and (36), 6;, is the Kronecker 6, that is, 6;, equals 1 if 7 = k 
and 0 otherwise; and the matrix A is given by 


0 
b 0 0 
1 
0 : (37) 


The structure of R(s) as given in (32) makes it possible to compute 
the entries r;(s) recursively. ro(s) satisfies the quadratic equation 
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pu(m — 1)/m + At 


2 
——$—$—— 3 
Atpts rererrara a. ) 


ro(s) = 


so that ro(s) is given by 


(Atpts)tv(A+u+s)? — 4d1u(m — 1)/m 
24 ; 


Since, for s = 0, | ro(s)| must be less than or equal to 1, the larger of 
the two roots is unacceptable. Thus, 
Atpts)—vV(At pts)? — 4d.u(m — 1)/m 
Bie _(Atets) (A +p +s) iH( )/ (40) 
21 

Once ro(s) is known, r;(s), for i = 1, can be computed recursively since 
rj(s) is expressible in terms of ro(s), --- , r-1(s). We note here that 
ro(s) has a form similar to that of o2{s) in Ref. 10. In fact, if A. = 0, 
then ro(s) and o2(s) are identical and represent the same quantity. 


ro(s) = (39) 


C.2 The structure of X;(s) 


Now that we have an explicit expression for R(s), we shall attempt 
to express the quantities X;(s) in terms of R(s). 
The kth entry of X;(s) can be written as 


{X.(s)}, = Ele? UT > Ti1}| Jot = (i, B)] 
+ Ele? iT < Ti-1}| Jor = (i, R)). (40) 
The first term on the right-hand side of (41) can be expanded as 
Ele? UT > Tia} [dor = (i, k)] = L Ele tT 
j=0 
UT > Tal{Ire, = G— 1,5) | Joe = (i k)], (42) 


which, because of the Markov property of J, reduces to 
py Ele7s(?-Ti-v) |\Jr+, = @@— 1,7), T> T;-1|E[e~7 
j=0 


UT > Tajl{Ire, = (i — 1, 7)}| Tor = (i, k)]. 


It follows from the memoryless property of the tagged customer’s 
service-time distribution that E[e“""-"” | Jrz, = (i- 1, J), T> Tia] 
equals {X;-1(s)};; and, for i = m, E[e“™-11{T > Tj Udre, = (i — 1, 
J)}| dot = (i, k)] equals {R(s)};,. Thus (42) reduces to 

Ele“? UT > Ty-1}| Jor = (i, k)] = (Ki-x(s)R(s)}, for i2=m. (43) 


To derive an expression for the second term in (41), we note that, 
for 1 = m, no sample path that figures in the expectation 
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Efe? 1{T s T,-1}|Jo+ = (i, k)] allows the multiprogramming level to 
fall below m before the departure of the tagged customer. Thus the 
server continues to operate at rate uw until the tagged customer’s 
departure. Also, for any two initial states (i,, k,) and (io, ke), as long 
as i, ly = m, there is a one-to-one correspondence between sample 
paths describing the trajectory of J, between 0 and 7’, which make 
equal contributions to the expectation. Thus, 


Ele“? 1{T < T-1}|Jor = (i, k)] = als) for i=m, (44) 


where a(s) is independent of 1 and k. 
To derive an expression for a(s), it will be convenient to introduce 
a quantity o(s) defined by 


o(s) = Eles™ 1{T > Ti1}| Jor = (i, R)] 
for t=>m, k=0,1,---. (45) 


Note that although the definition of o(s) involves the initial state 
(t, k), o(s) is independent of the latter as long as i = m. Also note that 


foo} 


o(s) = ¥ r,(s). (46) 


k=0 
By conditioning on the first relevant event to occur, o(s) can be 
written as 
Bs u(m — 1)/m di 
A tuts A tuts 


Ele7s(t- uv {Ty > Ti-1}|Jr7,,, = +1, +), Tir > Tisi]. (47) 


In (47) it can be seen that arrivals of type 2 customers are completely 
ignored since they do not affect the mechanics involved. Since the 
first passage time from the state (i + 1, -) to (i — 1, -) involves the 
sum of the first passage times from (i + 1, -) to (i, -) and from (i, -) 
to (i — 1, -), which are i.i.d., (47) reduces to 


pu(m — 1)/m Ai 2 
————— : 4 
Mtuts Mitre (48) 


a(s) 


o(s) = 


Equation (48) is identical to the one describing o2(s) in Ref. 10, with 
replaced by \,. The desired root of (48) is the smaller of the two 
roots so that 


_ Qi + ets) — VA. tu +8)? — 4udi(m — 1)/m 


o(s) ar 


(49) 


Now a(s) can be obtained in a straightforward manner from a(s). 
By conditioning on the first relevant event to occur, we write 
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p/m Ai 


1+ Efe P-T {7 < Te} | Ir 
wt+tA +s wtdy +8 le J Tha 


a(s) = 
p/m di 

k+tA ts ptrAts 
([Efes mT < TUT < T)|Ire, = Gt 1,-), > Tal 

+ Efes PHU T < TyUT > 73} \ dre, = G+ 1,-), T> Trl], (60) 


where 7” denotes the first passage time into the state (i, -) after the 
state has reached (i + 1, -) at time Tj4;. Following arguments similar 
to the ones used earlier, it can be shown that (50) can be written as 


p/m A 


=(+1,-),T>Txj= 


Pa 
i+1 


as) = ee te ge lls) + afs)a(s)], 
that is, 
a(s) = p/m[p + s — Axo(s)]. (51) 
The desired vector X;(s) can now be written as 
X;(s) = a(s) + X-1(s)R(s) for i =m, (52) 
where a(s) is the row vector (a(s), a(s), a(s), ---). 
If we introduce two more vectors a(s) and B(s), where 
p(s) = A 1, oh (58) 
and 
a(s) = a(s) — B(s) + X,-1(s)R(s), (54) 
the vectors X;(s) for 1 = m can be expressed as 
Xi(s) = B(s) + a(s)[R(s)]~™. (55) 


(The above equation can be proved by mathematical induction.) 

It can be seen that as i — ©, the term a(s)[R(s)]*” vanishes, so 
that X;(s) approaches its limiting value f(s). In other words, when 1 
is large, the tagged customer receives its entire service at the rate 
p/m as expected. 


C.3 Boundary conditions 


For i = m, eq. (55) characterizes the vectors X;(s) in terms of known 
quantities B(s), a(s), R(s), and the unknown X,,_;(s). In other words, 
if Xn-1(s) is known, X;(s) can be obtained, for i = m, directly from 
(55). To completely characterize the time spent in the service area by 
a tagged customer, it remains to derive the boundary conditions, that 
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is, a system of equations from which the quantities X,(s), X2(s), 
, Xz,-1(S), Xz, (s), +++, Xm-1(s) can be obtained. 
Without going into the details of derivation—it involves arguments 
similar to the ones used in the earlier analysis—we state the boundary 
conditions, which are as follows: 


BS Cpe eee 
de 
+77 as Rs) for ko t1<sism-—1, (56) 
= Lr,/ kp (kp — 1)ur,/ ko T 
0 ha Rare ar 1 Sr rer ae ar ew {X,,(s)A” + Xz,-1(8)eo} 
oe 
Veeder eee 
= Mry-1/ (Re =]) (Ro — 2) tr.—1/ (Re — 1) xX 
Annals) = "5 Hn- + S asa er Mig + S t-a(s) 
oS ali “es) 
Ak tie Ro 0» 
_ _ Bfi (i — 1)u,/i A 
sat Oa wuts Atwts eet Atmt+s Xina(s) 
for 2<i<k,—1, (59) 
and 
A 
Xi(s) =——4 X2(s), (60) 


— 
Atmts Atm+s 


where eg = [1000---] andl =[111---]. 

Equations (55) through (60) give a characterization of the LST of 
the distribution of the time spent in the service area by the tagged 
customer given the state of the system at the time it was admitted to 
the service area. To derive the mean time spent in the service area, 
we need to differentiate these quantities at s = 0. Noting that a(0) = 
0, we have 


X/(0) = B’(0) + @’(0)[R(O)]"" for i=m. (61) 


The boundary conditions also are obtained by differentiating the 
corresponding equations at s = 0: 
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1 (i — L)ui/t d 























0) Sa * a ge Oe 
OF at Oe a AO OO 
+ uy X/(O)A for ko +t1si<m-—-1 (62) 
At wy 
1 (ko — 1)pp,/Re T 
(0) = — 4 eR iP (XL, (OA? + X$,-1(0 
Xz, (0) ean a. [Xz,(O)A* + Xi,-1(O)eo] 
Ai de 
, + , 
ae. Xz,41 (0) Race X;,(0)A, (63) 
1 (ky — 2) tr—1/(Re = 1) 
X},-1(0) = —————_ + as _ X1,_2(10 
" 1(0) Ar Mko-1 At Mko-1 a 2( ) 
+—~— {XL (Jo, (64) 
A + phy-1 ky 0> 
10) = 2 4 GE Dail 5, Bin 
Xj (0) = ae reat Xi-1(0) areas X i+1(0) 
for 2<i<k—1, (65) 
and 
Xi(0) =- d + zs X4(0) (66) 
: Atm Atm” 
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Inequalities satisfied by the Laplace transforms of convex and log-convex 
functions are obtained. Applications are made to the M/G/1 queue waiting 
time and to an important teletraffic congestion problem, arising in parcel 
blocking studies. 


1. INTRODUCTION 


The purpose of this paper is to establish certain inequalities satisfied 
by the Laplace transform of convex functions and to illustrate their 
use. The notion of a-convexity on which these results are based is 
fully discussed.! This notion had its origin in the author’s investiga- 
tions concerning the inversion of the Laplace transform;? subse- 
quently, it has been applied to obtaining the results reported on here. 
The property of a-convexity forms a natural bridge between ordinary 
convexity (a = 0) and the stronger property of log-convexity (all a). 
This enables the formulation of a criterion in terms of the transform 
of a function for ascertaining the log-convexity of the function, vd. 
Theorem 2 and (11). 

An infinite set of inequalities satisfied by the Laplace transform of 
a log-convex function is obtained, vd. Theorem 3; these inequalities 
are illustrated by two applications. The first provides necessary con- 
ditions for the log-convexity of the complementary waiting time dis- 
tribution in the First-In First-Out (FIFO) M/G/1 queue. The condi- 
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tions are expressed in terms of the transform of the complementary 
service time distribution. 

The second application concerns the important teletraffic problem 
of ascertaining the time congestion of a call that overflows a primary 
group and is offered to a secondary group. Simple upper and lower 
bounds are obtained for a function [6;(x, a)] arising in Brockmeyer’s 
analysis of the problem and in terms of which the time congestion is 
obtained.® These results form an important part of “parcel blocking” 
analyses.* 


Il. a-CONVEXITY 


A function f(x) is said to be a-convex on an interval J if e“f(x) is 
convex on I. Clearly, ordinary convexity corresponds to a = 0. A 
sufficient condition for convexity of f(x) is° 


f'(x) 20, x El. (1) 
Introducing the function h(x) by 
? 
A(x) = eo a le“ f(x), (2) 
= f"(x) + 2af’(x) + a°f(x). (3) 
Then, in view of (1), the condition for a-convexity is 
h(x) 20, xeEl. (4) 


It should be observed that a-convexity does not imply convexity 
(a = 0). Thus consider, for example, f(x) = x°, which is a-convex for 
x = 0, a= 0. For a = 1, however, x® is a-convex for —3 — V3<x< 
—3 + v3. 

The a-convexity of a function may permit stronger bounds to be 
obtained on integrals of the function than ordinary convexity. For 
example, let p(x) = 0, and let f(x) be convex on J, then Jensen’s 
inequality states® 


J seopeoas = f(u) J pear, 


B= J ped / J ras (5) 


If f(x) is a-convex on J, then, since 
J f(x)p(x)dx = | e“f(x)e™ p(x) dx, (6) 
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one has 


J f(x)p(x)dx = ef (u) f e*p(x)dx, 


p= Jpn / J emptnax (7) 


This result can be stronger than (5). 
Let the Laplace transform, f(s), of f(x) be defined by 


f(s) = i e“f(x)dx, s>-y, (8) 


and let f(x) be a-convex for x = 0; then the following theorem may be 
stated. 


Theorem 1: 








‘ 1 — 1 
js) 2 oy(L), Cee 


or, equivalently, 
1 mr a 1 
<-e“f(--a}, a<-+y¥. 
fa) sree (2 «| oe 


Proof: Use of (7) with p(x) =e. 
A function f(x) > 0 is said to be log-convex on J if In f(x) is convex 
on J. Thus, the condition for log-convexity is 


f"(x)f(x) — f(x 20, x El. (9) 


In particular, log-convexity implies convexity, hence e“/(x) is convex; 
thus, a log-convex function is a-convex for all a. The following theorem 
asserts also the converse. 
Theorem 2: A function f(x) > 0 ts log-convex on an interval I if and 
only if it is a-convex on I for all a. 
Proof: It is necessary to prove only that f(x) is log-convex on J if it is 
a-convex on J for all a. This follows from (3) on observing that the 
discriminant of the quadratic in a is f’(x)? — f”(x)f(x); hence, a- 
convexity for all a implies (9) and the consequent log-convexity of 
f(x). 
Corollary: The sum of log-convex functions is log-convex. 
Proof: The sum of a-convex functions corresponding to the same a is 
clearly again a-convex for the same a; hence, the statement follows 
on applying the theorem. 

A function f(x) is said to be completely monotone on J if 
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(-1)' f(x) =0, xEl, r=0,1,2,---. (10) 


The Bernstein theorem,® which states that f(x) = 0 if and only if 
f(s) is completely monotone for s real and in the domain of conver- 
gence of (8), may be used to translate condition (4) in terms of f (s). 
Accordingly, let f(s) converge for s > 0 and let 


h(s) = (s + a)*f(s) — (s + 2a)f(0+) — f’(0+), s>0. (11) 


Then f(x) is a-convex if and only if h(s) is completely monotone for 
s > 0. Thus, also, f(x) is log-convex if and only if h(s) is completely 
monotone for s > 0 and all a. 


Il. INEQUALITIES FOR f(s) FROM LOG-CONVEXITY 
_ It will now be assumed that f(x) is log-convex for x > 0 and that 
f(s) converges for s > 0. Thus, one has 


min (-1)"h™(s) =>0, s>0, n=0,1,2,---. (12) 


The following theorem will now be proved. 
Theorem 3: If f(x) is log-convex for x > 0, then for all s > 0, one has 


sf(O+) — f’(0+) 


f(s)" s 70+)? ; 


f(0+) € 0, 


& F(s a ey f(0+) # 0, 


(n - oe > nf(s), nb 2. 


The equality signs are achieved for f(x) = e™”. 
Proof: One has for 





a> Fa — Ss, (13) 
min f(s) = sf(0) - f') ier > 0. (14) 
This establishes the first inequality. For 
a= ce — 5, (15) 
one has 
min[~A(s)] = et + f(0) = 0, (16) 
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which, after a little manipulation, yields the second inequality. For 
the choice 
fo? (R) 


FR) me 


a=—n 


direct calculation shows that 

min[(—1)"h™ (s)] 

? f(s)? 
f(s) 


In view of the complete monotonicity of f(s), the remaining inequali- 
ties are established. 


= (-1)"n c —1)f"(s)-—n | =0. (18) 


IV. CONVEXITY IN M/G/1 


An application of Theorem 3 will now be made to waiting time in 
the FIFO M/G/1 queue.’ In the following, \ is the arrival rate, pu is 
the service rate, p = \/u <1, us is the second moment about the origin 
of service time, and B(s), F(s) are the Laplace transforms of the 
complementary service and waiting time distributions, respectively. 
Theorem 4 may now be stated. 

Theorem 4: Necessary conditions for the complementary waiting time 
distribution, F(x), to be log-convex are 


2 1 ~ 1 
2 
yA s+ 
PEE a. rd 
Mie 





Proof: The Pollaczek-Khintchine formula’ may be written in the form 


1 p — A(s) 


F(s) = e 1 — rB(s) . (19) 
Clearly, 
B(s)~=, so; (20) 
s 

hence, 

fo ets. 5 @, (21) 
s s 

Thus, 

F (0+) = p, F’(0+) = —A(1 — p). (22) 
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Application of the first inequality of Theorem 3 yields 
1— f(s) _ s 1—p 
§ —— — s- +p. 
p—B(s) p p 
Observing that 8(s) is monotone decreasing, and that \8(0) = p <1, 
one has (s > 0) 


(23) 


1—AB(s) > 0, p—rAB(s) > 0. (24) 


Hence, multiplying (23) through by p — \B(s), and solving for f(s), 
one obtains 





S 1 
B(s) s eae (25) 
Using the second inequality of Theorem 3 in the form 
Bo\-1 > BQ)-1 s 
F(s)' = F(0)~ + FOs) (26) 
with the evaluation 
rs = i Abe 
FORT (27) 


yields the required lower bound for #(s). Finally, multiplying the upper 
and lower bounds by s and evaluating the limit s — © provides the 
last inequality of the theorem. 


V. AN OVERFLOW MODEL 

Consider the traffic model of Fig. 1, in which a is the offered load, 
assumed Poisson, to the primary trunk group of x trunks whose 
overflow of m erlangs and peakedness z is offered to the secondary 
trunk group of c trunks. Clearly, 


m = aB(zx, a), (28) 


in which B(x, a) is the Erlang loss function.”® The blocking (call 
congestion) experienced by a call overflowing the primary is given by 
the formula (equivalent random method)® 


_ B(x +, a) 
B. = B(x, a) 


An alternative expression for B, may be obtained from the Brockmeyer 
analysis.*> Let P(k) be the probability k trunks are busy on the 
secondary group and let m(k) be the corresponding load offered to the 
secondary; then 


(29) 
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| OVERFLOW 


| SECONDARY 


i 
Ae ' 


Fig. 1—Overflow system. 


PRIMARY 





in SE, (30) 
k=0 
B= m(c)P(c) (31) 
m 


One has also that the time congestion, Br, is given by 


Br= P(c). (32) 
In order to obtain P(c), the function 6;(x, a) is introduced by 
_ G,(-j — 1, a) 
De aie a 
in which G;(x, a) are Poisson-Charlier polynomials.’ Then 
Br = 6,(x, a)B(x + ¢, a). (34) 


It has been found‘ that 6;(x, a) satisfies the recursion 
bo(x, a) = B(x, a)”, 


x—-a@ 





5(x, a) =1+4 + ; ja(x,a)7, jel. (35) 


Another recursion for 6;(x, a) may be obtained by considering the 
integral 





I(x, a) = f e(1 + y)*y’dy. (36) 
Since 
G,(-j, a) = (-1)” a f e (1 + y)*y""dy, (37) 
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one has from (33) 


6;(0, a) = 1, J20, 
6;(x, a) = ; aa je. (38) 
One easily establishes 
T(x + 1, a) = I(x, a) + Tj41(x, a), (39) 
I(x, a) == I(x —1,a) + L Bale, a); (40) 


hence, from (39) and (40) 
x Tj (x a 1, a) 


j. Jj-1(x, a) as 


6j (x, a) 


Use of (39) finally yields 


x 


6;(x, a) = ——_~—_—. 
(x, a) a + j6;(x — 1, a) 


6(x-1l,a) +1, x21, (42) 

The recursion of (42) with initial value of (38) provides a convenient 
and stable method for the exact computation of 6;(x, a); however, in 
many practical investigations, it is useful to have upper and lower 
bounds showing simple and explicit dependence on the arguments. 
For this purpose, let P,(t) be the recovery function for a group of x 
trunks, that is, the probability that all trunks are busy at time t given 
they were all busy at time zero; then® 





Pus) = gee. (43) 
Comparison of (43) with (33) shows that 
Bj) ==. (44) 
J6; (x, a) 


It is also known that P,(t) is log-convex in t for t = 0.° The following 
theorem may now be proved. 


Theorem 5: For j = 1, x = 0, a = 0, the following inequalities hold 


1 -atl -at+1\ j(@@-1)- 
Se (:+2=ett) geod 
2 J J J 


2 
1 _ _ 4 
< 6;(x,a) <= ie ceca (1 +2=2) +], 
2 J J J 
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Proof: The upper bound is due to A. A. Fredericks,* who shows that 
6j(x,a) S6-1(x%,a), J2); (45) 
hence, from (35) one gets 
6:(x, a) <1 + a + ; 6(x, a). (46) 
Solution of the quadratic provides the upper bound of the theorem. 


_ Since P,(0) = 1, the second inequality of Theorem 3 applied to 
P,(s) in the form 


Pi(s+h)'-P,(sy'=h, hz, (47) 
yields, from (44), 
(j + h)bj+n(x, a) — J6j(x%,a) 2h, heod. (48) 
Setting h = 1 and writing (48) in the form 
6;-1(x, a)" = ea (49) 
yields, after substitution into (35), 
j-1 
Ia ea a aye (50) 


From (48) it follows that j6;(x,a) = 1; hence, in (50), multiplying 
through by j6;(x, a) — 1 yields the inequality 


6(x, a)? — (1 4 oa 6;(x, a) -tene >0. (51) 


Solution of this quadratic finally yields the lower bound of the theorem. 
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